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responses and effects on K + channels, may be performed in living cells, broken cells, 
isolated cell membranes, as well as in tissues and in living animals. To facilitate 
measurement of physiological GAB An receptor mediated responses, GABA B receptors 
may be co-expressed with promiscous G-proteins like e.g. Gal 6 or Gqi5 increasing G- 
protein coupling. Another possible way to increase G-protein coupling is to fuse the 
GABAb receptor with appropriate G-proteins using standard molecular techniques. In order 
to further improve readouts in Ca^-response assays, GABAb receptors may be co- 
expressed with aequorin, a photoprotein cloned from the luminescent jellyfish Aequorea 
victoria. 

In yet another aspect, the invention provides a pharmaceutical compositions comprising a 
soluble GABAb receptor further comprising one or more pharmaceutical acceptable 
carriers and/or diluents. 

15 In yet another aspect, the invention provides a method for the diagnosis or treatment of 
conditions related to GABA-dysfunction, e.g. epilepsy, psychiatric disorders such as 
depression and anxiety, cognitive dysfunction, gastroesophageal reflux disease, emesis, 
irritable bowel syndrome, dyspepsia, spasticity, arthritis, allergies, autoimmune diseases, 
neoplastic diseases, pain, and infectious diseases, comprising the use or administration of 

20 soluble forms of the GABAb receptor, such as the human GABAb receptor lc or Id or 
functionally equivalent modified forms thereof; preferably being a polypeptidfe of the 
present invention.. 

The soluble forms of the GABAb receptor can e.g. be produced by culturing a host cell 
25— harbouring a vector comprising a nucleic acid encoding the soluble GABAb receptor under 
conditions where whereby the said polypeptide is produced, the polypeptide, recovered and 
administered to a patient in need thereof. 



30 Furthermore, the level of soluble GABA B receptors in certain body fluids, e.g. serum and 
CSF, can be measured and used in diagnostics related to conditions with altered levels of 
soluble GABAb receptors in said body fluids. 
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Throughout this description the terms "standard protocols" and "standard procedures", 
when used in the context of molecular cloning techniques, are to be understood as protocols 
and procedures found in an ordinary laboratory manual such as: Current Protocols in 
Molecular Biology, editors F. Ausubel et al M John Wiley and Sons, Inc. 1994, orSambrook, 
J., Fritsch, E.F. and Maniatis, T., Molecular Cloning: A laboratory manual, 2nd Ed., Cold 
Spring Harbor Laboratory Press, Cold Spring Harbor, NY 1989. 



EXAMPLES 

EXAMPLE 1 A: Cloning and sequencing of cDNA encoding human GABAb receptor la 
and lb 

Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, CA, 
15 USA) (catalog #6578-1). First-strand cDNA synthesis reactions were performed using the 
First-strand cDNA Synthesis kit from Amersham Pharmacia Biotech (Uppsala, Sweden). 
The pd(N) 6 primer was used to prime the first-strand synthesis. The generated cDNA 
molecules were used as template in the PCR reactions described below. 

20 Specific PCR primers were designed (Table 1) based on the sequences of the rat GABAb 
receptor la and lb cDNA (Kaupmann et al. 1997, EMBL accession numbers Y10369 (SEQ 
ID NO: 44) and Y10370 (SEQ ID NO: 46)). 
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TABLE 1 

Primers used for RT-PCR on mRNA from human hippocampus 



Nr. 


Species 


Sequence 5 ' - 3 ' 


SEQ XD NO 


794 


Rat 


GTTTCTTCTCGGATCCAGCTGTGCCTG 


1 


795 


Rat 


C AGGC AC AGC TGGAT C C G AGAAGAAACT 


2 


796 


Rat 


CGGTCGACTCACTTGTAAAGCAAATGTACTCGACTCCC 


3 


797 


Rat 


ATGCGCGCCGGCAGCCAACATGCTGCTGCTGCTGCTGGTGC 


4 


831 


Rat 


CGGTCGACTCACTTGTAAAGCAAATGTACTCGACTCCCATCACAGC 


5 


838 


Rat 


ATGCGCGCCGGCAGCCAACATGCTGCTGCTGCTGCTGGTGCCTCTCTTCC 


6 


842 


Rat 


CAGGCACAGCTGGATCCGAGAAGAAACTCTGTCGGAAAGT 


7 


863 


Rat 


GGTCATCCAGCGTTGAGGTGAAGAC 


8 


864 


Rat 


GAAGGTTG C C AG ATTATACATC CGC 


9 


865 


Rat 


C CACGATGATTCGAGC ATCTTGACG 


10 


866 


Rat 


GCCTCTCACTCCC CTCATCTCC 


11 


932 


Human 


G AGTGAAGG AGGC TGGAATTG 


12 



5 cDNA fragments encoding parts of the human GAB A B receptors were amplified directly 
by PCR using the designed primers with the generated cDNA molecules as templates. All 
PCR experiments were carried out using the Perkin Elmer Taq DNA polymerase with Gene 
Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: +95°C 
for 1 minute, +50°C for 30 seconds, +72°C for 3 minutes, repeated 44 times and finally 

10 +72°C for 7 minutes. The following primer combinations gave PCR products with 

expected sizes: 838 and 842, 838 and 795, 797 and 865, 864 and 865, 864 and 863 which 
corresponds to the 5 '-end of the GABA B receptor la. Primer pair 932 and 831 , 932 and 
796, 794 and 83 1 gave PCR products which correspond to the 3 '-end of both GAB A B 
receptor la and lb. The primer combination 839 and 918 gave a PCR product 

1 5 corresponding to the 5 '-end of the GABA B receptor lb. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
20 cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 

primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to the cDNA encoding the GABAb receptor were 

used. 
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Additional PCR primers were designed based on the obtained sequences encoding 
fragments of the human GAB A B receptor and additional DNA fragments encoding parts of 
the human GAB A B receptors were amplified by PCR, the PCR products subcloned and 
sequenced as described above. 



EXAMPLE IB : Cloning and sequencing of the 3' -ends of the cDNA encoding human 
GABAb receptors la and lb 

Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, CA, 
USA) (catalogue #6578-1). First strand cDNA synthesis reactions were performed using 
the First-strand cDNA Synthesis kit from Amersham Pharmacia Biotech (Uppsala, 
Sweden). The Not I-d(T)i 8 primer was used to prime the first-strand synthesis. The 
generated cDNA molecules were used as template in the PCR reactions described below. 

Specific PCR primers were designed (Table 2 ) based on the sequences of the human 
GABAb receptor la and lb cDNA obtained in Example 1 A and the EST sequence EMBL 
accession number Yl 1044. 

By homology searches in the EMBL database using the GABAb receptor cDNA sequences 
obtained in Example 1 A as query the EST sequence EMBL accession number Yl 1044 
have been found to be homologous to the 3' -end of the GABAb receptor cDNA. 



TABLE 2 

Primers used in PCR to amplify 3' ends of human GABAb receptor cDNA 



Nr 


Species 


Sequence 5'- 3' 


SEQ ID NO 


938 


Human 


GACGCITATCGAGCAGCTTC 


13 


972 


Human 


AGCCCAGAACTCACAGGGGGACAT 


14 


973 


Human 


GCTTCAAGCCAGGTACGAACTAA 


15 



cDNA fragments encoding parts of the human GABAb receptors were amplified directly by 
30 PCR using the designed primers with the generated cDNA molecules as templates. All 
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PCR experiments were carried out using the Perkin Elmer Taq DNA polymerase with Gene 
Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: + 95°C 
for 1 minute, +50°C for 30 seconds,+72°C for 3 minutes, repeated 44 times and finally 
+72°C for 7 minutes. The following primer combinations gave PCR products with 
5 expected sizes: 938 and 972, 938 and 973 corresponding to the 3 'end of both GAB As 
receptor la and lb cDNA. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
10 sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to the cDNA encoding the GAB As receptor were 
used. 

15 

EXAMPLE 1C : Cloning and sequencing of the 5' -end of the cDNA encoding human 
GAB Ab receptor lb 

20 Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, CA, 
USA) (catalogue #6578-1). Marathon cDNA amplification Kit (Clontech) was used for 
performing 573'- RACE (Rapid Amplification of cDNA Ends). Adaptor- ligated double 
stranded cDNA molecules were amplified according to the manufacturers description. The 
pd(N) 6 primer from the First-strand cDNA Synthesis kit from Amersham Pharmacia 
* 25 Biotech (Uppsala, Sweden) was used to produce the adaptor-ligated cDNA. 

A specific PCR primer was designed (Table 3) based on the sequences of the human 
GABAb receptor lb cDNA obtained in Example 1A. 
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TABLE 3 

Primers used in PCR to amplify the 5 '-ends of human GABA B receptor cDNA 



Nr. 


Species 


Sequence 5 *-3* 


SEQ ID NO 


958 


Human 


TGGCCCTCCACCGCCTCAGTCATCTCA 


16 


API 


Marathon kit 


CC ATCCTAATACG ACTC ACT AT AGGG C 


17 



5 cDNA fragments encoding part of the human GABAb receptors were amplified direcdy by 
PCR using the designed primers with the generated adaptor-ligated cDNA molecules as 
template. The PCR experiment was carried out using the Expand Long Template PCR 
System (Boehringer Mannheim GmbH, Germany) with the following PCR program: + 
94°C for 1 minute, + 94°C for 30 seconds, +60°C for 30 seconds, +68°C for 4 minutes, 
10 repeated 24 times. The primer combination API and 958 gave a PCR product which 
corresponded to the 5 '-end of the GABAb receptor lb cDNA, including 190 base pairs 
upstream the initiation codon. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
15 USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to the cDNA encoding the GABAb receptor were 
20 used. 

EXAMPLE ID : Cloning and sequencing of the 5'-end of the cDNA encoding human 
GABAb receptor la 

25 

Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, USA) 
(catalogue #6578-1). Marathon cDNA amplification Kit (Clontech) was used to obtain 
adaptor-ligated double stranded cDNA molecules according to the manufacturer's 
description. The pd(N) 6 primer from the First-strand cDNA Synthesis kitfrom Amersham 
30 Pharmacia Biotech (Uppsala, Sweden) was used to obtain the adaptor-ligated cDNA . 
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Specific PCR primers were designed (Table 4) based on the sequences of the human 
GABAb receptor la cDNA obtained in Example 1 and the rat GABA B receptor la cDNA 
disclosed in WO 97/46675. 

5 

TABLE 4 

Primers used to amplify S'-ends of the human GABAB receptor la cDNA 



Nr. 


Species 


Sequence 5 '-3* 


SEQ ED NO 


1033 


Human 


CTCAATCTCATAGTCCACTGG 


18 


1087 


Rat 


CCTTGAGGCCCGGGGAGAG 


19 



10 cDNA fragments encoding part of the human GABAB la receptor were amplified directly 
by PCR using the designed primers with the generated adaptor-Iigated cDNA molecules as 
template. The PCR was performed using the Perkin Elmer Taq DNA polymerase with 
Gene Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: 
+94°C for 1 minute, +50°C for 30 seconds,+72°C for 3 minutes; +94°C for 1 minute, +60°C 

15 for 30 seconds,+72°C for 4 minutes, repeated 34 times and finally +72°C for 7 minutes. 

The primer combination 1087 and 1033 gave a PCR product corresponding to the 5 '-end of 
the GABAb receptor 1 a cDNA , including 26 base pairs upstream the initiation codon. 

20 The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 

USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 

25 pGEM-T were used. 

A complete cDNA sequence encoding the human GABA B receptors la (SEQ ID NO: 48) 
and the human GABA B receptor lb (SEQ ID NO: 50) were obtained by aligning the 
sequences of the different fragments cloned and sequenced in Examples 1A, IB, 1C and 
30 ID. 
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EXAMPLE 2A: Cloning and sequencing of cDNA encoding canine GABAg receptor la 

5 QuickPrep Micro mRNA Purification kit (Amersham Pharmacia Biotech, Uppsala, 
Sweden) was used to isolate mRNA from canine neural tissues according to the 
manufacturers description. First-strand cDNA synthesis reactions were performed using the 
First-strand cDNA Synthesis kit from Amersham Pharmacia Biotech (Uppsala, Sweden). 
The Not-I-d(T)i8 Afunctional or pd(N) 6 primer was used to prime the first-strand synthesis. 
10 The generated cDNA molecules were used as template in the PCR reactions described 
below. 

Specific PCR primers (Table 5) were designed based on the sequences of the rat GABA B 
receptor la and lb cDNA (Kaupmann et al. 1997, EMBL accession numbers Y 10369 (SEQ 
15 ID NO: 44 and Y10370 (SEQ ED NO: 46)). 



TABLE 5 

Primers used for RT-PCR on mRNA from canine cortex 



Nr. 


Species 


Sequence 5 '-3' 


SEQ ID NO 


795 


Rat 


CAGGCACAGCTGGATCCGAGAAGAAACT 


20 


831 


Rat 


CGGT C GAC TC ACTTGTAAAGC AAATGT ACT CG AC TC C CAT C AC AGC 


21 


338 


Rat 


ATGCGCGCCGGCAGCCAACATGCTGCTGCTGCTGCTGGTGCCTCTCTTCC 


22 


840 


Rat 


CGTCAAGATGCTCGAATCATCG 


23 


841 


Rat 


CAGGGGGCTCAGAGGGTCCC 


24 


842 


Rat 


CAGGCACAGCTGGATCCGAGAAGAAACTCTGTCGGAAAGT 


25 


844 


Rat 


CGGTCGACTCACTTGTAAAGCAAATGTACTCGACTCCCATCACAGCTAAG 


26 


848 


Rat 


ACTTTCCGACAGAGTTTCTTCTCGGATCCAGCTGTGCCTG 


27 


865 


Rat 


C C AC G ATG ATTC GAGC ATCTTGAC G 


28 



20 

cDNA fragments encoding parts of the canine GABA B receptor were amplified directly by 
PCR using the designed primers with the generated cDNA molecules as templates. All 
PCR experiments were carried out using the Perkin Elmer Taq DNA polymerase with Gene 
Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: +95°C 
25 for 1 minute, +50°C for 30 seconds, +72°C for 3 minutes, repeated 44 times and finally 
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+72°C for 7 minutes. The following primer combinations gave PCR products with 
expected sizes: 842 and 838, 838 and 795, 838 and 865 which corresponds to the 5 '-part of 
the canine GABA B receptor cDNA . Primer pair 848 and 844, 848 and 83 1, 848 and 841, 
840 and 841 gave PCR products which correspond to the 3 '-part of the canine GABA B 
5 receptor cDNA. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
10 cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to the cDNA encoding the GABAg receptor were 
used. 

15 

EXAMPLE 2B: Cloning and sequencing of the 3'- and 5' -ends of the cDNA encoding 
canine GAB Ab receptor 1 a 

QuickPrep Micro mRNA Purification kit (Amersham Pharmacia Biotech, Uppsala, 
20 Sweden) was used to isolate mRNA from canine nerve tissues according to the 

manufactures description. Marathon cDNA amplification Kit (Clontech, Palo Alto, CA, 
USA) were used for performing both 5 '-and 3 "-RACE. Two adaptor-Iigated double 
stranded cDNA libraries were amplified according to the manufacturers description. A 
random primer (pd(N)e) was used when amplifying the adaptor-Iigated cDNA for the 5'- 
25 RACE and the Marathon cDNA Synthesis primer (52-mer) was used when amplifying the 
adaptor ligated cDNA for the 3* -RACE. 

Specific PCR primers were designed (Table 6) based on the sequence of canine GABAb 
receptor la cDNA obtained in Example 2A. 

30 
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TABLE 6 

Primers used in PCR to amplify the 5'- and 3' -ends of canine GABAb receptor la cDNA 



Nr. 


Species 


Sequence 5 '-3' 


SEQ ID NO 


936 


canine 


CTACCGCGCAATGAACTCCTCGTC 


29 


1076 


canine 


CGAGGTGGCGTTGGGGGTCTGTGC 


30 


API 


Marathon kit 


CCATCCTAATACGACTCACTATAGGGC 


31 


AP2 


Marathon kit 


ACTCACTATAGGGCTCGAGCGGC 


32 



5 cDNA fragments encoding parts of the canine GABAb receptor were amplified by PCR 
from the adaptor-ligated cDNA using the designed primers. A number of different PCR 
programs were tested to find conditions where PCR products corresponding to GABAb 
receptor DNA were obtained. The 5'- PCR experiments were carried out using the Expand 
Long Template PCR System (Boehringer Mannheim GmbH, Germany) with the following 
10 PCR program: +94°C for 30 seconds, +72°C for 3 minutes, repeated 4 times; +94°C for 30 
seconds, +70°C for 3 minutes, repeated 4 times; and +94°C for 30 seconds, +68 °C for 
3minutes, repeated 24 times. The primer combination AP2 and 1076 gave a PCR product 
which corresponded to the 5 '-end of the GABAb receptor cDN A, including 1 14 base pairs 
upstream the initiation codon. 

15 

The 3'- PCR experiments were carried out using the Expand Long Template PCR System 
(Boehringer Mannheim GmbH, Germany) with the following PCR program: +94°C for 1 
minute; and +94°C for 30 seconds, +60°C for 30 seconds,+68 °C for 4 minutes, repeated 
29 times. The primer combination API and 936 gave a PCR fragment which corresponded 
20 to the 3 '-end of the GABAb receptor cDNA, including the poly(A) tail. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
25 cycle sequencing pre-mix kit(Amersham Pharmacia Biotech, Uppsala, Sweden). As 

primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to GABAb receptor DNA were used*. 
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Complete cDNA sequence encoding the canine GAB A© receptor la (SEQ ED NO: 52) was 
obtained by aligning the sequences of the different fragments obtained in Example 2A and 
Example 2B. 

5 

EXAMPLE 3 A: Cloning of cDNA encoding human GAB As receptor 1c and Id from 
Jurkat cells 

A guanidinisothiocyanate/CsCl method was used to isolate total RNA from Jurkat cells. 
10 The first-strand cDNA synthesis was performed using the First-strand cDNA Synthesis kit 
from Amersham Pharmacia Biotech (Uppsala, Sweden). The pd(N>6 primer was used to 
prime the first strand synthesis. The generated cDNA molecules were used as templates in 
the PCR reaction described below. 

15 Specific PCR primers (Table 7) were designed based on the sequence of human GABAb 
receptor la and lb cDNA (Example 1), rat GABAb receptor (Kaupmann et al. 1997) and 
the EST sequence EMBL accession number Yl 1044. 

TABLE 7 

20 Primers used in RT-PCR on mRNA from Jurkat cells 



Nr 


Species 


Sequence 5'-3' 


SEQ ID NO 


938 


human 


GACGCTTATCGAGCAGCTTC 


33 


972 


human 


AGCCCAGAACTCACAGGGGGACAT 


34 


973 


human 


GCTTCAAGCCAGGTACGAACTAA 


35 


893 


rat 


GGAGCACCCCCAAGCCCCACTG 


36 


937 


human 


CTGGTTCCTCCCAATGTG 


37 


1005 


rat 


CCTCTCACTCCCCTCATCTC 


38 


1030 


human 


AAGCCAACCITCCCTGCTTCrC 


39 



cDNA fragments encoding parts of the GABAb receptor were amplified directly by PCR 
using human and rat specific primers. All PCR experiments were carried out using the 
25 Perkin Elmer Taq DNA polymerase with Gene Amp (Roche Molecular Systems Inc., NJ, 
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USA) with the following PCR program: +95°C for 1 minute; +54°C for 1 minutes, +72°C 
for 3 minutes, repeated 44 times; and finally +72°C for 7 minutes. 



The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 
5 USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
pGEM-T or primers complementary to GABAb receptor DNA were used. 

10 

The following primer combination gave PCR products corresponding to the 3 '-end of the 
GABAb receptor cDNA: primer pairs 938 and 972; 938 and 973. Unexpectedly both these 
fragments lacked 149 base pairs, resulting in a frame shift and the insertion of a new 
termination codon. The following primer combination gave a PCR product corresponding 
15 to the 5 '-part of the GABAb receptor la cDNA: 893 and 937. The primer pairs 1005 and 
937, 1030 and 937 gave PCR products corresponding to the 5 '-part of the GABAb receptor 
lb cDNA. Also these PCR fragments lacked the same 149 base pairs which resulted in a 
frame shift and the insertion of a new termination codon. 

20 These results show that Jurkat cells contain mRNA encoding two new forms of the human 
GABAb receptor. These are designated GABAb receptor lc (SEQ ID NO: 54 and 55) 
(comprising the same 5 '-part as the GABA B receptor la) and GABA B receptor Id (SEQ ID 
NO: 56 and 57) (comprising the same S'-part as the GABAb receptor lb). These two forms 
of the GABAb receptor do not contain any of the transmembrane regions and are therefore 

25 expected to be soluble forms of the receptor. 

EXAMPLE 3B: Analysis of cDNA encoding human GABAb receptors from hippocampus 

30 Messenger RNA from human hippocampus was obtained from Clontech (Palo Alto, USA) 
(catalogue #6578-1). First strand cDNA synthesis reactions were performed using the First- 
strand cDNA Synthesis kit from Amersham Pharmacia Biotech (Uppsala, Sweden). The 
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pd(N) 6 primer was used to prime the first-strand synthesis. The generated cDNA molecules 
were used as template in the PCR reactions described below. 

Specific PCR primer was designed (Table 8) based on the sequences of the cDNAs 
5 encoding human GABA B receptors ia and lb. 

TABLE 8 

Primers used for RT-PCR on mRNA from human hippocampus 



Nr 


Species 


Sequence 5' - 3* 


SEQ ID NO 


937 


Human 


CTGGTTCCTCCCAATGTG 


40 


938 


Human 


GACGCTTATCGAGCAGCTTC 


41 



10 

cDNA fragments encoding parts of the human GAB As receptors were amplified directly by 
PCR using the designed primers with the generated cDNA molecules as templates. All 
PCR experiments were carried out using the Perkin Elmer Taq DN A polymerase with Gene 
Amp (Roche Molecular Systems Inc., NJ, USA) with the following PCR program: +94°C 
15 for 1 minute, +50°C for 30 seconds,+72°C for 3 minutes; +94°C for 1 minute, +54°C for 30 
seconds, +72°C for 3 minutes repeated 44 times; and finally +72°C for 7 minutes. The 
primer combination 938 and 937 gave a PCR product which corresponded to the expected 
size of the GABA B receptor la and lb cDNA but also a fragment a smaller size. 

20 The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 

USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 

25 pGEM-T or primers complementary to the cDNA encoding the GAB Ab receptor were 
used. 

The larger PCR fragment was found to correspond to the 3' -part of the GABAb receptor la 
and lb cDNA and the smaller fragment which lacked 149 base pairs was found to 
30 correspond to the 3' -part of the GABAb receptors lc and Id cDNA identified in Example 
3A. 
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EXAMPLE 4: Cloning and sequencing of cDNA encoding canine GABAg receptor lb 

5 cDNA encoding the canine GABA B receptor lb is isolated in a similar manner as 
described in Example 2 for receptor la. PCR primers specifically designed to be 
complementary to the 5 '-end of the cDNA encoding the rat and human GABAq receptor 
lb, together with PCR primers complementary to the 3 '-end of the cDNA encoding the 
canine GABA B receptor la, and mRNA prepared from a suitable canine tissue, are used. 

10 

EXAMPLE 5: Cloning of cDNA encoding canine GABAb receptor 1c 

Total RNA from canine liver was prepared using the RNeasy Total RNA Purification 
15 Protocols (Quiagen GmbH, Germany). The first-strand cDNA synthesis was performed 
using the First-strand cDNA Synthesis kit from (Amersham Pharmacia Biotech, Uppsala, 
Sweden). The pd(N) 6 primer was used to prime the first -strand synthesis. The generated 
cDNA molecules were used as templates in the PCR reaction described below. 

20 Specific PCR primers (Table 9) were designed based on the sequence of canine GABAb 
receptor la cDN A. 

TABLE 9 

Primers used in RT-PCR 

25 



Nr 


Species 


Sequence 5 '-3* 


SEQ ED NO 


936 


canine 


CTACCGCGCAATGAACTCCTCGTC 


42 


954 


canine 


CCTTCTTCTCCTCCTTCTTAGTGA 


43 



cDNA fragments encoding parts of the canine GABAb receptor were amplified directly by 
PCR using canine specific primers. All PCR experiments were carried out using the Perkin 
Elmer Taq DNA polymerase with Gene Amp (Roche Molecular Systems Inc., NJ, USA) 
30 with the following PCR program: +95°C for 1 minute, +54°C for 30 seconds, +72°C for 3 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 22 PCT/SE98/01947 

minutes, repeated 44 times and finally +72°C for 7 minutes. The primer combination gave 
a PCR product with a size corresponding to the GABAb receptor la but also a fragment of 
smaller size indicating the presence of a GABAb receptor lc. 

5 The PCR products were subcloned into the pGEM-T vector from Promega (Madison, 

USA). The inserts were subjected to nucleotide sequence analysis, the complete nucleotide 
sequence for all subclones were determined using a Thermo Sequenase dye terminator 
cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As 
primers for sequencing reactions specific oligonucleotides complementary to the vector 
10 pGEM-T were used. 

The smaller fragment was shown to lack 149 base pairs. This deletion caused a frame shift 
and an insertion of a new termination codon, verifying the existence of a canine GABAb 
receptor lc. 

15 

Complete cDNA sequence encoding the canine GABAg receptor lc (SEQ ID NO: 58) was 
obtained by aligning the sequences of the fragments obtained in Example 2A, Example 2B 
and Example 5. 

20 

EXAMPLE 6. Cloning, sequencing and organization of human GABAb receptor genomic 
fragments 

To determine the structural organization and sequence of the human GABAb receptor gene, 
25 human genomic DNA libraries and human genomic DNA were screened and analyzed. 
Human genomic libraries were obtained from Clontech (Palo Alto, CA, USA). The 
libraries were constructed from female leukocyte DNA (catalog # HL1 1 1 1 J), cloned into 
XEMBL-3 vector. The average size of inserts are 16 kb and the number of independent 

clones are 1.7x10^. Human genomic DNA was obtained from Clontech (catalog # 6550-1). 
30 In order to isolate recombinant phages containing exon and intron sequences of the human 
GABAb receptor gene, 48 individual bacterial plates with a diameter of 1*50 mm and 
approximately 4x10^ individual plaques per plate, were screened. The methods and 
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solutions used were as described in the Library Protocol Handbook: General Procedures for - 
the Hybridization of Lambda Phage Libraries w/DNA Probes (Clontech) with some modi- 
fications as will be apparent from the following. 

5 The experiment was carried out essentially as follows. The numbers are given per plate 
basis. A sample of the phage library diluted in 0. 1 ml sterile lambda diluent was prepared 
in order to obtain an estimated titer of 40,000 pfu (plaque forming units). A 0.6 ml LB- 
medium culture of the E. coli host strain K802 (obtained from Clontech) was infected with 
40000 pfu recombinant phages for 15 minutes at +37°C. The culture was then mixed with 
10 7 ml top agarose (6.5 g of agarose added per liter LB) and poured onto LB plates. The 

plates were incubated at +37°C for approximately 7 hours. The plates were then chilled at 
+4°C. 



Plaque hybridization experiments were as follows. Membrane filters, Colony/Plaque 
15 Screen (DuPont, Wilmington, DE, USA), were placed onto the top of the plates for 3 

minutes. For denaturation of DN A the filters were removed and floated in 0.5 M NaOH on 
a plastic wrap for 2 minutes, with the plaque side up. This step was repeated once to ensure 
efficient denaturation. Following neutralization the membrane filters were placed in 1M 
Tris-HCl pH 7.5, two times 2 minutes and allowed to dry. 

20 

To obtain probes for DNA hybridization screening of the membrane filters, a GAB Ab 
receptor cDNA clone was digested with SacII and a 479 bp fragment (base pairs 573-1051 
in the cDNA encoding human GABA B receptor la, SEQ ID NO: 48) was separated by 
agarose electrophoresis, excised and transferred to a polypropylene microcentrifuge tube. 

25 Additional probes were obtained by PCR amplification of various regions of the GABAb 
receptor cDNA (base pairs 68-486 and 2368-2863 in the cDNA encoding human GABAb 
receptor la, SEQ ID NO: 48). The isolated cDNA fragment was 32 P-labeled using 
Megaprime DNA labeling system (Amersham Pharmacia Biotech, Uppsala, Sweden) by 
the following procedure. Water was added at a ratio of 3 ml per gram of gel, and placed in 

30 a boiling water bath for 7 minutes to melt the gel and denature the DNA. A volume of 
DNA/agarose solution containing 25 ng of DNA was added to the labeling reaction, 
according to the supplier's instructions. Labeled nucleotides were removed from DNA 
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labeling reactions using MicroSpin™ G-50 Columns (Amersham Pharmacia Biotech, 
Uppsala, Sweden ). 

The DNA hybridization reaction was performed under stringent conditions according to the 
5 method described below. The filter membranes were prehybridized at +65°C for at least 1 
hour in a solution composed of 1% SDS, 1M NaCl, and 10% dextran sulfate using a 
hybridization oven (Hybaid Ltd, Ashford, UK). Following prehybridization a solution 
containing denatured herring sperm DNA of a final concentration of 100 ng/ml and the 
32 P-labeled DNA probe at a concentration <10 ng/ml (for optimal signal to background 

10 ratio) was added to the prehybridization solution and the membrane filters were incubated 
at +65°C for 10-20 hours. Following the removal of the hybridization solution the 
membrane filters were first washed in a 2xSSC (0.3M NaCI, 0.03M Na-citrate), 1% SDS 
solution two times for 5 minutes at room temperature. In the next step, the membrane 
filters were incubated +60°C two times for 30 minutes each in the same solution. In a third 

15 step, the filters were washed two times at room temperature in O.lxSSC. Finally, the 

membrane filters were placed on a sheet of filter paper with the DNA face up, and allowed 
to dry. The dried membrane filters were then exposed to X-ray films and autoradiographed. 

Of the approximately 2xl0 6 individual plaques analyzed, four hybridizing plaques were 
20 detected and isolated. These three isolates were designated #GR1, #GR12, #GR13 and 

#GR41, respectively. After several rescreening experiments, the recombinant phage DNA 
was purified using Qiagen Lambda Midi Kit (Qiagen GmbH, Germany). The purified DNA 
was digested with Sail and the fragments representing the inserts were isolated by agarose 
electrophoresis. 

25 

The sizes of the inserts were approximately isolate #GR1, 12 kb, isolate #GR12, 12 kb; 
isolate #GR13, 16kb; and isolate #GR41, 19 kb. These fragments were cloned into Sail 
digested linearized pUC19, resulting in the plasmids pAM362 (isolate #GR1), pAM363 
(isolate #GR12), pAM364 (isolate #GR13)and pAM365 (isolate #GR41). The inserts from 
30 the four plaques which hybridizes to the GAB Ab receptor cDNA probes were analyzed by 
PCR, restriction mapping and hybridization to 32 P-labeled DNA fragments representing 
various regions of the GAB A B receptor gene. 
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The cloned fragments in the plasmids pAM362, pAM363, pAM364 and pAM365 were 
characterized by restriction enzyme mapping, using EcoRI, Hindin, PstI, and BamHI. The 
approximate positions of the exons and the approximate size of the introns were analyzed 
5 and determined by PCR-based exon-exon linking and agarose gel electrophoresis. 

In order to facilitate nucleotide sequence analysis, 7 restriction sub-fragments derived from 
pAM364, 2 restriction fragments derived from pAM362, and 1 restriction sub-fragment 
derived from pAM365 were isolated and cloned into pUC19, resulting in the plasmids 
10 pAM366-pAM375. The following strategy was employed; by combining PCR primers 

located within the pUC19 sequence either upstream or downstream of the cloning site, with 
a PCR primer with defined orientation and specific for the GAB A B receptor derived 
subcloned fragment allowed the determination. 

15 The inserts in the 10 plasmids pAM366-pAM375 were subjected to nucleotide sequence 
analysis. The nucleotide sequences for all subclones were determined using a Thermo 
Sequenase dye terminator cycle sequencing pre-mix kit (Amersham Pharmacia Biotech, 
Uppsala, Sweden). As primers for sequencing reactions specific oligonucleotides comple- 
mentary to pUC19 or primers complementary to the GABA B receptor cDNA were used. 

20 

The genomic fragments cloned in the plasmids pAM362-pAM365 where shown to contain 
the complete transcribed part of the human GABAb receptor gene and extends > 3kb 
upstream of the first exon and > 2kb downstream of the last exon. The fragments cloned in 
the plasmid pAM362 were shown to contain exons 7-11, pAM363 exons 12-22, pAM364 
25 exons 1-11, and pAM365 exons 12-23 of the GABAb receptor gene (Figure 1). The 

sequence of exons 1-11 and introns 1-10 is listed in SEQ ID NO: 60 and the sequence of 
exons 12-23 and introns 12-22 is listed in SEQ ID NO: 61. 

The human GABA B receptor gene consists of 23 exons and 22 introns (Figure 1). The size 
30 of the exons range between 21 bp and 1486 bp. As can be noted from Table 10 exon/intron 
boundaries are in accordance with the AG/GT rule and conform well to the consensus 
sequence suggested by Mount et al. 1982. 
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TABLE 10 

Exon-Intron boundaries of the GABA B receptor gene, sequences at exon-intron junctions. 







5 'splice donor 


3 'splice acceptor 




Eacon 


1 


-Exon 


2 


CGAG 


GTAAGAG 


CCGCCTCTCACTTAG 


ATGT 


Bacon. 


2 


-Exon 


3 


GAAG 


GTGCATC 


« CGACTCACC CCTTAG 


GTTG 


Sxon 


3 


-Exon 


4 


TOTG 


GTGAGTA. 


CCWATCTCTCCACAG 


TCCG 


Exon 


4 


-Exon 


5 


CACSG 


GTGAGGG 


CTTTCCTGCTGCCAG 


TOAA 


Eacon 


5 


-Exon 


6 


TCAG 


GTGAGAT 


CGCACCCCTCCTCAG 


AACG 


Exon 


6 


-Exon 


7 


CAAG 


GTAGCCC 


CCTCTTGTCTTTCAG 


TOTG 


Exon 


7 


-Exon 


8 


TOTG 


GTAAGCA. « ... 


CTCCCTGCCCCACAG 


CTTT 


Exon 


8 


-Exon 


9 


TTCG 


GTGAGGA 


.TTATTC CC AC CC AAG 


ACTC 


Exon 


9 


-ExonlO 


GAAG 


GTCAGAT 


CTTTCTCTGTKGTAG 


CGCC 


ExonlO 


-Exonll 


TGAG 


GTGGART 


CTCCTCTGTATTCAG 


GTGT 


Exonll 


-Exonl2 


CATG 


GTGAGAG 


TTTTTTCCTCCCAAG 


ACAT 


Exonl2 


-Exonl3 


CTCT 


GTGAGTT 


TGTTCCTTCCCTCAG 


GGCC 


Exonl3 


-Exonl4 


CAGG 


GTTAGTA 


™..«TTGTCGTCTGCCCAG 


GTGG 


Exonl4 


-Exonl5 


ATTG 


GTGAGTG 


CC CTGTGC CATGC AG 


GAGG 


ExonlS 


-Bxonl6 


TCCG 


GTXAGTT 


CCACCTCTGCCCTAG 


TTAT 


Sxonl6 


-Exonl7 


CCAG 


GTGAGGA 


TCTCTTCCTTTCTAG 


GCCC 


Exon 17 


-Exon 18 


GAAG 


GTGAGCT 


CACATATTTATCCAG 


ACTC 


Exonl8 


-Bxonl9 


TGAG 


GTACCAC 


TYGTTTCTGC CCTAG 


ACAT 


Sxonl9 


-Exon20 


CTTG 


GTGTGTG 


CTCCTGCCATCCTAG 


GCAT 


Exon20 


-Exon21 


GGCA 


GTGAGCA 


TGTCTTTCCCTCTAG 


GTCC 


Exon21 


-Exon22 


CAAG 


GTAAGGA 


AACATTTGCCCCCAG 


ATGC 


Exon22 


-Exon23 


TGAG 


GTGCGGG 


TGCTTCTTCCTCCAG 


AAAG 



A comparison of the sequences of the different forms of the human GAB Ab receptor 
cDNA (SEQ LD NO: 48, 50, 54, and 56) and the sequence of the human GAB As gene 
(SEQ ID NO: 60 and 61) reveals that different mRNA encoding human GABAb receptors 
are formed by alternative splicing. The translational start of the GABAb receptor la is 
localized in exon 2 and the translational stop is localized in exon 23. The mRNA encoding 
GABAb receptor lb is formed by alternative splicing where part of intron 5 is retained 
together with exon 6 where the translational start of the GABAb receptor lb is derived 
from the intron sequence. The mRNA encoding GABA B receptor lc is formed by 
alternative splicing where exon 15 are removed together with introns 14 and 15 and a 
frame shift and a translational stop are generated in the sequence corresponding to exon 16. 
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The mRNA encoding GABA B receptor Id is formed when the translational start of the ~ 
GABAb receptor lb is generated together with the translational stop of the GABAb 
receptor lc. 

5 The la isoform is formed by splicing exon 5 to a cryptic splice site in the middle of exon 6. 
Transcription of lb isoform mRNA is most likely initiated from regulatory elements in 
intron 5. The ATG initiating translation of lb mRNA is located in the 5' -end of exon 6. 

Additional variants of the mRNA encoding variants of the human GABAb receptor can be 
10 derived by alternative splicing where one ore more of the exons or part of exons are 

excised in the processing of the pre-mRNA. The subsequent transcription of these mRNA 
will give rise to variants of the human GABAb receptor with potentially different 
biological and/or pharmacological activities. 

15 

EXAMPLE 7. Analysis of cDNA encoding human GABAb receptors from human brain 

Messenger RNA from human fetal brain (catalog #6525-1) and adult human brain (catalog 
#6516-1) were obtained from Clontech (Palo Alto, CA, USA). First strand cDNA synthesis 
20 reactions were performed using the first strand cDNA Synthesis kit from Amersham 
Pharmacia Biotech (Uppsala, Sweden) .The pd(N)6 primer was used to prime the first- 
strand synthesis. The generated cDNA molecules were used as templates in the PCR 
reactions described below. 

25 Specific PCR primers was designed (Table 11) based on the sequences of the rat GABAb 
receptor la and lb cDNA and human GABAb receptor la and lb cDNA 
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Primers used for RT-PCR on mRNA from human fetal brain 



Nr 


Species 


Sequence 5 '-3' 


SEQ ID NO 


838 


Rat 


ATGCGCGCCGGCAGCCAACATGCTGCTGCTGCTGCTGGTGCCTCT 
CTTCC 


62 


863 


Rat 


GGTCATCCAGCGTTGAGGTGAAGAC 


63 


864 


Rat 


GAAGGTTGCCAGATTATACATCCGC 


64 


865 


Rat 


CCACGATGATTCGAGCATCTTGACG 


65 


937 


Human 


CTGGTTCCTCCCAATGTG 


66 


1015 


Human 


CCAGTGGACTATGAGATTGAG 


67 



5 cDNA fragments encoding parts of the human GAB Ab receptors were amplified directly by 
PCR using the designed primers with the generated cDNA molecules as templates. PCR 
experiments with primers 838, 863 ,864 and 865 were carried out using the Perkin Elmer 
Taq DNA polymerase with Gene Amp (Roche Molecular System Inc., NJ, USA) with the 
following PCR program: +95°C for 1 minute, +50°C for 30 seconds,+72°C for 3 minutes, 

10 repeated 44 times and finally +72°C for 7 minutes. PCR experiments with primers 937 and 
1015 were carried out using the Expand Long Template PCR System (Boehringer 
Mannheim GmbH, Germany) with the following PCR program: +94°C for 2 minutes; 
+94°C for 10 seconds, +55°C for 30 seconds, +68°C for 3 minutes repeated 35 times; and 
finally +68°C for 7 minutes. The primer combinations 838 and 863, 864 and 863 , 864 and 

15 865, 937 and 1015 gave PCR products. 

The PCR products were subcloned into the pGEM-T vector from Promega (Madison, WI, 
USA). The inserts were subjected to nucleotide sequence analysis. The complete nucleotide 
sequence for all subclones were determined using a ThermoSequenase dye terminator cycle 
20 sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As for primers 
for sequencing reactions specific oligonucleotides complementary to the vector pGEM-T or 
primers complementary to the cDNA encoding the GABAb receptor were used. 

A number of analyzed clones isolated from fetal brain were found to lack 186 base pairs, 
25 corresponding to exon 4. This alternative splicing resulted in a cDNA (SEQ ID NO: 70) 
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encoding a protein (SEQ ID NO: 71) comprising of 899 amino acids and designated 
GABAb receptor le. 



Other clones from fetal brain lacked 368 base pairs, corresponding to exons 4, 5 and 6, 
5 resulting in a cDNA (SEQ ID NO: 72) where a frame shift and a translational stop codon 
are generated in the sequence corresponding to exon 7, encoding a protein (SEQ ID NO: 
73) comprising only 97 amino acids designated GAB A© receptor If. 

One clone lacked 207 base pairs, corresponding to exons 4 and 5, resulting in a cDNA 
10 (SEQ ID NO: 74) encoding a protein (SEQ ID NO: 75) comprising 892 amino acids 
designated GABA B receptor lg. 

Another clone had two deletions, the first comprising 186 base pairs corresponding to exon 
4, the second comprising 39 base pairs corresponding to part of exon 6, resulting in a 
15 cDNA (SEQ ID NO: 76) encoding a protein (SEQ ID NO: 77) comprising 886 amino acids 
designated GABA B receptor lh. 

Yet another clone from adult human brain was found to have a long deletion comprising 
1 194 base pairs corresponding to base pairs 319 - 1512 of the cDNA encoding human 
20 GABA B receptor 1 a. This deletion corresponds to part of exon 4, exons 5-11, and part of 
exon 12. This cDNA (SEQ ID NO: 78) encodes a protein (SEQ ID NO: 79) comprising 
563 amino acids designated GABA B receptor li. 

One clone isolated from fetal brain lacked 284 base pairs corresponding to part of exon 3 
25 and the whole exon 4, generating a frame shift and a translational stop codon in the 

sequence corresponding to exon 5. This cDNA (SEQ ID NO: 80) encodes a protein (SEQ 
ID NO: 81) comprising only 105 amino acids designated GABAb receptor lj. 

30 EXAMPLE 8. Analysis of cDNA encoding human GABAb receptors from Jurkat cells 

A guanidinisothiocyanate/CsCl method was used to isolate total RNA from Jurkat cells. 
Firsthand cDNA synthesis reactions were performed using the first strand cDNA 
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Synthesis kit from Amersham Pharmacia Biotech (Uppsala, Sweden).The pd(N)& primer 
was used to prime the first-strand synthesis. The generated cDNA molecules were used as 
templates in the PCR reactions described below. 

5 Specific PCR primers was designed (Table 12) based on the sequences of the human 
GAB A B receptor la and lb cDNA 

TABLE 12 

Primers used for RT-PCR on mRNA from Jurkat cells 

10 



Nr 


Species 


Sequence 5*-3' 


SEQ ID NO 


937 


Human 


CTGGTTCCTCCCAATGTG 


68 


1015 


Human 


CCAGTGGACTATGAGATTGAG 


69 



cDN A fragments encoding parts of the human GAB Ab receptors were amplified directly by 
PCR using the designed primers with the generated cDNA molecules as templates. The 
PCR experiment was carried out using the Expand Long Template PCR System 
15 (Boehringer Mannheim GmbH, Germany) with the following PCR program: +94?C for 2 
minutes; +94°C for 10 seconds, +55°C for 30 seconds, +68°C for 3 minutes repeated 35 
times; and finally +68°C for 7 minutes. The primer combination 937 and 1015 gave a PCR 
product. 

20 The PCR products were subcloned into the pGEM-T vector from Promega (Madison, WI, 
USA). The inserts were subjected to nucleotide sequence analysis. The complete nucleotide 
sequence for all subclones were determined using a ThermoSequenase dye terminator cycle 
sequencing pre-mix kit (Amersham Pharmacia Biotech, Uppsala, Sweden). As for primers 
for sequencing reactions specific oligonucleotides complementary to the vector pGEM-T or 

25 primers complementary to the cDNA encoding the GABAb receptor were used. 

Two clones had two deletions, the first comprising 368 base pairs corresponding to exons 
4, 5 and 6, the second comprising 151 base pairs corresponding to exon 15 where a frame 
shift and a translational stop codon are generated in the sequence corresponding to exon 7. 
30 This cDNA (SEQ ID NO: 82) encodes a protein (SEQ ID NO: 83) comprising only 98 
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amino acids designated GABAb receptor lk which is identical to the GABA B receptor If 
described above. 

Another two clones also had two deletions, the first comprising 246 base pairs 
5 corresponding to part of exon 4, exon 5 and exon 6, the second comprising 149 base pairs 
t corresponding to exon 15, generating a frame shift and a translational stop codon in the 

sequence corresponding to exon 16. This cDNA (SEQ ID NO: 84) encodes a protein (SEQ 
ID NO: 5) comprising 496 amino acids designated GABAb receptor 11. 

10 Additional variants of cDNA encoding the human GABAb receptors can be identified in a 
similar manner using PCR primers based on the sequence of the cDNA and genomic 
fragments encoding the human GABAb receptors disclosed in the present application. 

The biological activity of these variants of the human GABAb receptor can be evaluated by 
15 transfection of suitable host cells with expression vectors comprising the corresponding 
cDNA sequences and subsequent measurement of binding of labeled ligands or activation 
of the receptor or modulation of receptor function. 

20 EXAMPLE 9: Generation of antibodies 

Antibodies were raised in rabbits against four different BS A-conjugated 20 amino acids 
long synthetic peptides corresponding to selected regions of the human and canine GABAb 
receptor extracellular domain. Two were directed against a sequence common for GABAb 
25 receptor la and b (abl and ab2), one against a GABAb receptor la-specific region (al) and 
one against a GABA B receptor lb-specific sequence (bl). To allow BSA-conjugation, a 
cy stein residue was added to the N-terminus in all peptides but al, which contains an 
endogenous cystein. 

30 Peptide al: (SEQ ID NO: 49 amino acids 18-37) 

Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie He Jiis Pro Pro Trp 
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Peptide abl : (SEQ ID NO: 49 amino acids 197-216 with N-terminally added Cys) 

Cys - Glu Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp Tyr Glu Leu Lys Leu lie 

His His 

5 Peptide ab2: (SEQ ID NO: 49 amino acids 271-290 with N-terminally added Cys) 

Cys - Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 
Ser Ala 

Peptide bl: (SEQ ID NO: 57 amino acids 30-47 with N-terminally added Cys) 
10 Cys - Ser His Ser Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser 

The antibodies were purified from rabbit serum by affinity chromatography using the 
corresponding immobilized peptide and subsequently used to detect expression of 
recombinant GABA B receptor isoforms on Western blots. 

15 

EXAMPLE 10: Heterologous expression of GABAb receptor isoforms in mammalian cells 

A Hindm / Sail cDNA fragment encoding the human GABAb receptor lb isoform was 
20 cloned into a BPV (bovine papilloma virus)-based expression vector containing the mMT- 
1 (murine metallothioneine) promoter. Using the calcium phosphate method, murine CI 27 
cells were co-transfected with the GABAb receptor expression construct and an expression 
plasmid containing a G418 resistance marker gene using the calcium phosphate method. 
G418 resistant clones were evaluated by Western blot analysis for expression of the 
25 approximately 100 kDa GABA B receptor lb isoform(Figure 2). Similarly, but using 

Lipofectamine (Life Technologies Inc, Rockville, MD, USA) for transfection, the human 
GABAb receptor lb isoform was expressed in human HEK-293 cells using the pCI-neo 
expression vector. The identity of the heterologously expressed receptor was verified in 
HEK-293 cells by Western blot analysis and radiologand binding experiments. 

30 

A cDNA fragment encoding the human GABAb receptor Id isoform was^cloned into a 
BPV-based expression vector containing the mMT-1 promotor. Using the calcium 
phosphate method, murine CI 27 cells were co-transfected with the GABAb receptor 
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expression construct and an expression plasmid containing a G418 resistance marker gene. 
G418 resistant clones, and concentrated medium from the same clones, were evaluated for 
GABAb receptor Id isoform expression by Western blot analysis (Figure 3). The 
experiment revealed the human GABAb receptor Id to be a secreted isoform. 

5 

'T- 

EXAMPLE 1 1 : Heterologous expression of GABAb receptor isoforms in E. coli 

A cDNA fragment encoding the human GABAb receptor Id isoform was cloned into a 
10 modified pET (Pharmacia Amersham, Uppsala, Sweden) vector downstream of a STII 

(heat stable enterotoxin II of E. coli) signal peptide and followed by a thrombin clevage site 
and a hexahistidine tag. The expression construct was subsequently used to transform the 
BL21 (DE3) £. coli strain. Western blot analysis of IPTG-induced bacteria revealed 
expression of a human GABAb receptor Id isoform of expected size(Figure 4). 

15 

In addition, the human GABAb receptor Id isoform was successfully expressed without 
fusion to a bacterial signal peptide in E. coli strain AD494 (DE3). 

20 EXAMPLE 12: Method for the screening of substances which are GABAb receptor 
antagonists or agonists 

GABAb receptor expressing cells, transgenic animals or cells and tissues derived thereof, 
are used to screen substance libraries for antagonist or agonist activities. Screening is 
25 performed as ligand binding assays or functional assays. For screening, cells and tissues are 
prepared in various ways, each uniquely suited to its purpose. Ligand binding assays are 
performed in vivo or in vitro. Functional assays exemplified by, but not limited to, Ca 4 ^- 
responses, cAMP-responses and effects on Cl~ and K + channels, are performed in living 
cells, broken cells, isolated cell membranes, as well as in tissues and in living animals. 
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1. A nucleic acid molecule encoding a human or canine GABAg receptor, or a 
functionally equivalent modified form thereof. 

2. A nucleic acid molecule according to claim 1 encoding a human GABAg receptor, or 
a functionally equivalent modified form thereof. 

3. A nucleic acid molecule according to claim 2 encoding the human GAB Ajj receptor 
la, or a functionally equivalent modified form thereof. 

4. A nucleic acid molecule according to claim 3 selected from: 

(a) a nucleic acid molecule comprising a nucleotide sequence set forth as SEQ ID 
NO: 48; 

(b) a nucleic acid molecule comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary to 
the polypeptide coding region of a DNA molecule as defined in (a); and 

(c) a nucleic acid molecule comprising a nucleotide sequence which is degenerate as 
a result of the genetic code to a nucleotide sequence as defined in (a) or (b). 

5. A nucleic acid molecule according to claim 2 encoding the human GABAg receptor 
lb, or a functionally equivalent modified form thereof.. 

6. A nucleic acid molecule according to claim 5 selected from: 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: 
50; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary the 
polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleotide sequence as defined in (a) or (b). 
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7. A nucleic acid molecule according to claim 2 encoding the human GAB A3 receptor — 
lc, or a functionally equivalent modified form thereof. 



8. A nucleic acid molecule according to claim 7 selected from: 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: 
54; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary the 
polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleotide sequence as defined in (a) or (b). 

9. A nucleic acid molecule according to claim 2 encoding the human GABAq receptor 
Id, or a functionally equivalent modified form thereof.. 



10. A nucleic acid molecule according to claim 9 selected from: 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: 
56; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary the 
polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleotide sequence as defined in (a) or (b). 

11. A nucleic acid molecule according to claim 1 encoding a canine GAB Ag receptor, or 
a functionally equivalent modified form thereof. 

12. A nucleic acid molecule according to claim 1 1 encoding the canine GAB Ag receptor 
la, or a functionally equivalent modified form thereof. 

13. A nucleic acid molecule according to claim 12 selected from 
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(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: ~ 
52; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary the 
polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleic acid sequence as defined in (a) or (b). 

14 A nucleic acid molecule according to claim 1 1 encoding the canine GABAg receptor 
1c, or a functionally equivalent modified form thereof 

15. A nucleic acid molecule according to claim 14 selected from 

(a) nucleic acid molecules comprising a nucleotide sequence set forth as SEQ ID NO: 
58; 

(b) nucleic acid molecules comprising a nucleotide sequence capable of hybridizing, 
under stringent hybridization conditions, to a nucleotide sequence complementary the 
polypeptide coding region of a DNA molecule as defined in (a); and 

(c) nucleic acid molecules comprising a nucleotide sequence which is degenerate as a 
result of the genetic code to a nucleic acid sequence as defined in (a) or (b). 

16. A recombinant polypeptide encoded by a nucleotide sequence according to any one 
of claims 1 to 15. 

17. The polypeptide according to claim 16 comprising an amino acid sequence set forth 
as SEQ ID NO: 49, 51, 53, 55, 57 or 59 in the Sequence Listing. 

18. A polypeptide according to claim 16 which has been posttranslationally modified. 

19. A vector transformed with a nucleic acid molecule according to any one of claims 1 
to 15. 

20. A cultured host cell harboring a vector according to claim 19. 
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21. A process for the production of a GABAjj receptor, said process comprising 

culturing a host cell according to claim 20 under conditions whereby the said 
polypeptide is produced, and recovering the said polypeptide. 

5 

22. A method for the screening of compounds which are inhibitors of transient lower 



esophageal sphincter relaxations, said method comprising the use of a nucleic acid 
molecule encoding a GABAg receptor. 



10 23. A method according to claim 22 wherein the said nucleic acid molecule encoding a 
GABAg receptor is the nucleic acid molecule according to any one of claims 1 to 15. 

24. A method for the screening of compounds which are agonists or antagonists to a 
GAB Ag receptor, said method comprising the use of a nucleic acid molecule 



25. A method according to any one of claims 22 to 24 comprising the steps (a) 
transforming a cultured cell with a nucleic acid molecule encoding a GAB A3 

receptor, so that a GAB Ajj receptor is expressed on the surface of the cell; (b) 



compound binds to, and/or activate, the GABAg receptor. 

26. A pharmaceutical compositions comprising a soluble GABAb receptor further 
comprising one or more pharamceutical acceptable carriers and/or diluents. 



15 



according to any one of claims 1 to 15. 



20 



contacting a test compound with the said cell; and (c) determining whether the test 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: ASTRA AB 

(B) STREET: Vastra Malarehamnen 9 

( C ) CITY : Soder tal j e 

( E ) COUNTRY : Sweden 

(F) POSTAL CODE (ZIP) : S-151 85 

(G) TELEPHONE: +46-8-553 260 00 

(H) TELEFAX: +46-8-553 288 20 

(I) TELEX: 19237 astra s 

(ii) TITLE OF INVENTION: New nucleotide sequences 
(iii) NUMBER OF SEQUENCES: 85 

<iv) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk. 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE; Patentln Release #1.0, Version #1.30 (EPO) 

(2) INFORMATION FOR SEQ ID NO: 1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 

( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 1: 



GTTTCTTCTC GGATCCAGCT GTGCCTG 



27 



HI 865-1 WO SEQ 



1 



SUBSTITUTE SHEET (RULE 26) 



10 



WO 99/21890 PCT/SE98/01947 



( 2 ) INFORMATION FOR SEQ ID NO : 2 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 28 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 
15 CAGGCACAGC TGGATCCGAG AAGAAACT 28 



( 2 ) INFORMATION FOR SEQ ID NO : 3 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
{ D ) TOPOLOGY : 1 inear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 3: 
30 CGGTCGACTC AC TTGT AAAG CAAATGTACT CGACTCCC 3 8 



(2) INFORMATION FOR SEQ ID NO: 4: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



40 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 4: 
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ATGCGCGCCG GCAGCCAACA TGCTGCTGCT GCTGCTGGTG C .41 



(2) INFORMATION FOR SEQ ID NO : 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 5: 
CGGTCGACTC AC TTGTAAAG CAAATGTACT CGACTCCCAT CACAGC 



(2) INFORMATION FOR SEQ ID NO : 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 
ATGCGCGCCG GCAGCCAACA TGCTGCTGCT GCTGCTGGTG CCTCTCTTCC 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 7: 
CAGGCACAGC TGGATCCGAG AAGAAACTCT GTCGGAAAGT 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 8: 
GGTCATCCAG CGTTGAGGTG AAGAC 



( 2 ) INFORMATION FOR SEQ ID NO : 9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 9: 
GAAGGTTGCC AGATTATACA TCCGC 



(2) INFORMATION FOR SEQ ID NO : 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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5 



10 



25 



(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
CCACGATGAT TCGAGCATCT TGACG 25 

(2) INFORMATION FOR SEQ ID NO: 11; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 11: 

20 

GCCTCTCACT CCCCTCATCT CC 22 



(2) INFORMATION FOR SEQ ID NO: 12; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

■% 

35 

GAGTGAAGGA GGCTGGAATT G 21 



(2) INFORMATION FOR SEQ ID NO: 13: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 
GACGCTTATC GAGCAGCTTC 20 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
AGCCCAGAAC TCACAGGGGG ACAT 24 

25 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

35 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

GCTTCAAGCC AGGTACGAAC TAA 23 

40 , 

(2) INFORMATION FOR SEQ ID NO: 16: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
TGGCCCTCCA CCGCCTCAGT CATCTCA 27 



(2) INFORMATION FOR SEQ ID NO : 17 : 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

25 

CCATCCTAAT ACGACTCACT ATAGGGC 27 



(2) INFORMATION FOR SEQ ID NO: 18: 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
35 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 

40 

CTCAATCTCA TAGTCCACTG G 21 
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(2) INFORMATION FOR SEQ ID NO: 19: — 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 
CCTTGAGGCC CGGGGAGAG 19 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CAGGCACAGC TGGATCCGAG AAGAAACT 28 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 46 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO: 21: 



CGGTCGACTC ACTTGTAAAG CAAATGTACT CGACTCCCAT CACAGC 



46 
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(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 0 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS: single 
( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
ATGCGCGCCG GCAGCCAACA TGCTGCTGCT GCTGCTGGTG CCTCTCTTCC 

(2) INFORMATION FOR SEQ ID NO : 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 
CGTCAAGATG CTCGAATCAT CG 



(2) INFORMATION FOR SEQ ID NO : 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
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CAGGGGGCTC AGAGGGTCCC 



5 (2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

15 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

CAGGCACAGC TGGATCCGAG AAGAAACTCT GTCGGAAAGT 



20 (2) INFORMATION FOR SEQ ID NO : 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

25 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 26: 

CGGTCGACTC ACTTGTAAAG CAAATGTACT CGACTCCCAT CACAGCTAAG 



35 (2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 40 base pairs 
<B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 
ACTTTCCGAC AGAGTTTCTT CTCGGATCCA GCTGTGCCTG 

(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
CCACGATGAT TCGAGCATCT TGACG 

(2) INFORMATION FOR SEQ ID NO : 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 
CTACCGCGCA ATGAACTCCT CGTC 

(2) INFORMATION FOR SEQ ID NO : 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 
CGAGGTGGCG TTGGGGGTCT GTGC 

(2) INFORMATION FOR SEQ ID NO : 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 31: 
CCATCCTAAT ACGACTCACT ATAGGGC 

(2) INFORMATION FOR SEQ ID NO: 32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 
ACTCACTATA GGGCTCGAGC GGC 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

<ii) MOLECULE TYPE: DNA 

5 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO : 33: 
GACGCTTATC GAGCAGCTTC 



10 

(2) INFORMATION FOR SEQ ID NO: 34: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 24 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D) TOPOLOGY : 1 inear 

<ii) 

20 

(xi) 

AGCCCAGAAC TCACAGGGGG ACAT 

25 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 23 base pairs 
30 <B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) 

35 

(xi) 

GCTTCAAGCC AGGTACGAAC TAA 

40 

(2) INFORMATION FOR SEQ ID NO: 36: 
(i) SEQUENCE CHARACTERISTICS: 



24 



23 



MOLECULE TYPE: DNA 

SEQUENCE DESCRIPTION: SEQ ID NO: 34: 



MOLECULE TYPE: DNA 

SEQUENCE DESCRIPTION: SEQ ID NO : 35: 



H1865-1 WOSEQ 13 



SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 



PCT/SE98/01947 



(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 36: 
10 GGAGCACCCC CAAGCCCCAC TG 



(2) INFORMATION FOR SEQ ID NO: 37: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 
25 CTGGTTCCTC CCAATGTG 



(2) INFORMATION FOR SEQ ID NO : 38: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 38: 
40 CCTCTCACTC CCCTCATCTC 



(2) INFORMATION FOR SEQ ID NO : 39: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

5 <C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

( ii ) MOLECULE \TYPE : DNA 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 39: 

AAGCCAACCT TCCCTGCTTC TC 22 

15 (2) INFORMATION FOR SEQ ID NO : 40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

20 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 

CTGGTTCCTC CCAATGTG 18 

30 (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS: single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 41: 

GACGCTTATC GAGCAGCTTC 20 

H1865-1 WO SEQ 15 
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(2) INFORMATION FOR SEQ ID NO : 42: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2 4 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
{ D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 

CTACCGCGCA ATGAACTCCT CGTC 24 

15 

(2) INFORMATION FOR SEQ ID NO : 43 : 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 
CCTTCTTCTC CTCCTTCTTA GTGA 24 

30 

(2) INFORMATION FOR SEQ ID NO: 44: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 2 883 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNESS : s ingle 
< D ) TOPOLOGY : 1 inear 

40 (ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

H1865-1 WOSEQ 16 
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(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus norwegicus 

5 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .2883 

10 (x) PUBLICATION INFORMATION: 

(A) AUTHORS: Kaupmann, K 

Huggel , K 
Heid, J 
Flor, P 

15 Bischoff, S 

Mickel, S 

McMaster, G 

Angst, C 

Bittiger, H 
20 Froestl, W 

(B) TITLE: Expression cloning of GABA-B receptors 

uncovers similarity to metabotropic glutamate 
receptors 

(C) JOURNAL: Nature 
25 (D) VOLUME: 3 86 

(F) PAGES: 239-246 

(G) DATE: 20 march-1997 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

ATG CTG CTG CTG CTG CTG GTG CCT CTC TTC CTC CGC CCC CTG GGC GCT 48 
Met Leu Leu Leu Leu Leu Val Pro Leu Phe Leu Arg Pro Leu Gly Ala 
15 10 15 

35 GGC GGG GCG CAG ACC CCC AAC GCC ACC TCG GAA GGT TGC CAG ATT ATA 96 
Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie lie 
20 25 30 

CAT CCG CCC TGG GAA GGT GGC ATC AGG TAC CGT GGC TTG ACT CGC GAC 144 
40 His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg Asp 
35 40 45 

CAG GTG AAG GCC ATC AAC TTC CTG CCT GTG GAC TAT GAG ATC GAA TAT 192 

H1865-1 WO SEQ 17 
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Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Giu Tyr 
50 55 60 

GTG TGC CGA GGG GAG CGC GAG GTG GTG GGG CCC AAG GTG CGC AAA TGC 240 
Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys Cys 
65 70 75 80 

CTG GCC AAC GGC TCC TGG ACG GAT ATG GAC ACA CCC AGC CGC TGT GTC 288 
Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys Val 
85 90 95 

CGA ATC TGC TCC AAG TCT TAT TTG ACC CTG GAA AAT GGG AAG GTT TTC 336 
Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val Phe 
100 105 110 

CTG ACG GGT GGG GAC CTC CCA GCT CTG GAT GGA GCC CGG GTG GAG TTC 3 84 

Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Glu Phe 
115 120 125 

20 CGA TGT GAC CCC GAC TTC CAT CTG GTG GGC AGC TCC CGG AGC GTC TGT 432 
Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser Val Cys 
130 135 140 

AGT CAG GGC CAG TGG AGC ACC CCC AAG CCC CAC TGC CAG GTG AAT CGA 480 
25 Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn Arg 
145 150 155 160 

ACG CCA CAC TCA GAA CGG CGT GCA GTA TAC ATC GGG GCG CTG TTT CCC 528 
Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro 
30 165 170 175 

ATG AGC GGG GGC TGG CCG GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG 576 
Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu 
180 185 190 



35 



40 



ATG GCG CTG GAG GAC GTT AAC AGC CGC AGA GAC ATC CTG CCG GAC TAC 624 
Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr 
195 200 205 

GAG CTC AAG CTT ATC CAC CAC GAC AGC AAG TGT GAC CCA GGG CAA GCC 672, 
Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala 
210 215 220 
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ACC AAG TAC TTG TAC GAA CTA CTC TAC AAT- GAC CCC ATC AAG ATC ATT 7-2-0 

Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie 
225 230 235 240 

CTC ATG CCT GGC TGT AGT TCT GTC TCC ACA CTT GTA GCT GAG GCT GCC 7 68 

Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala 
245 250 255 

CGG ATG TGG AAC CTT ATT GTG CTC TCA TAT GGC TCC AGT TCA CCA GCC 816 

Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala 
260 265 270 



TTG TCA AAC CGA CAG CGG TTT CCC ACG TTC TTC CGG ACG CAT CCA TCC 864 
Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser 
15 275 280 285 

GCC ACA CTC CAC AAT CCC ACC CGG GTG AAA CTC TTC GAA AAG TGG GGC 912 
Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly 
290 295 300 

20 

TGG AAG AAG ATC GCT ACC ATC CAA CAG ACC ACC GAG GTC TTC ACC TCA 960 
Trp Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser 
305 310 315 320 

25 ACG CTG GAT GAC CTG GAG GAG CGA GTG AAA GAG GCT GGG ATC GAG ATC 1008 
Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie 
325 330 335 

ACT TTC CGA CAG AGT TTC TTC TCG GAT CCA GCT GTG CCT GTT AAA AAC 1056 
30 Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn 
340 345 350 

CTG AAG CGT CAA GAT GCT CGA ATC ATC GTG GGA CTT TTC TAT GAG ACG 1104 
Leu Lys Arg Gin Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr 
35 355 360 365 

GAA GCC CGG AAA GTT TTT TGT GAG GTC TAT AAG GAA AGG CTC TTT GGG 1152 
Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly 
370 375 380 



AAG AAG TAC GTC TGG TTC CTC ATC GGG TGG TAT GCT GAC AAC TGG TTC 1200 

Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn* Trp Phe 
385 390 395 400 
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AAG ACC TAT GAC CCG TCA ATC AAT TGT ACA GTG GAA GAA ATG ACC GAG 12? 8 

Lys Thr Tyr Asp Pro Ser lie Asn Cys Thr Val Glu Glu Met Thr Glu 
405 410 415 

GCG GTG GAG GGC CAC ATC ACC ACG GAG ATT GTC ATG CTG AAC CCT GCC 1296 

Ala Val Glu Gly His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala 
420 425 430 

AAC ACC CGA AGC ATT TCC AAC ATG ACG TCA CAG GAA TTT GTG GAG AAA 1344 

Asn Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys 

435 440 445 



CTA ACC AAG CGG CTG AAA AGA CAC CCC GAG GAG ACT GGA GGC TTC CAG 1392 
15 Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin 
450 455 460 

GAG GCA CCA CTG GCC TAT GAT GCT ATC TGG GCC TTG GCT TTG GCC TTG 1440 
Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu 
20 465 470 475 480 

AAC AAG ACG TCT GGA GGA GGT GGT CGT TCC GGC GTG CGC CTG GAG GAC 1488 

Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp 

485 490 495 

25 

TTT AAC TAC AAC AAC CAG ACC ATT ACA GAC CAG ATC TAC CGG GCC ATG 153 6 

Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met 

500 505 510 

30 AAC TCC TCC TCC TTT GAG GGC GTT TCT GGC CAT GTG GTC TTT GAT GCC 1584 
Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala 
515 520 525 



AGC GGC TCC CGG ATG GCA 
35 Ser Gly Ser Arg Met Ala 
530 

AGC TAC AAG AAG ATC GGC 
Ser Tyr Lys Lys lie Gly 
40 5 4 5 5 5 0 

TGG TCC AAA ACG GAC AAG 
Trp Ser Lys Thr Asp Lys 



TGG ACA CTT ATC GAG CAG 
Trp Thr Leu He Glu Gin 
535 540 

TAC TAC GAC AGC ACC AAG 
Tyr Tyr Asp Ser Thr Lys 
555 

TGG ATT GGA GGG TCT CCC 
Trp He Gly Gly Ser Pro 



CTA CAG GGC GGC 1632 
Leu Gin Gly Gly 

GAT GAT CTT TCC 1680 
Asp Asp Leu Ser 
560 

CCA GCT GAC CAG 172 8 

Pro Ala Asp Gin 
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565 570 575 ~ 

ACC TTG GTC ATC AAG ACA TTC CGT TTC CTG TCT CAG AAA CTC TTT ATC . 1776 
Thr Leu Val lie Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie 
5 580 585 590 

TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT GTT CTT GCT GTT GTC TGT 1824 

Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala Val Val Cys 
595 600 605 

10 

CTG TCC TTT AAC ATC TAC AAC TCC CAC GTT CGT TAT ATC CAG AAC TCC 1872 

Leu Ser Phe Asn He Tyr Asn Ser His Val Arg Tyr He Gin Asn Ser 
610 615 620 

15 CAG CCC AAC CTG AAC AAT CTG ACT GCT GTG GGC TGC TCA CTG GCA CTG 1920 
Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu 
625 630 635 640 

GCT GCT GTC TTC CCT CTC GGG CTG GAT GGT TAC CAC ATA GGG AGA AGC 1968 
20 Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His He Gly Arg Ser 

645 650 655 

CAG TTC CCG TTT GTC TGC CAG GCC CGC CTT TGG CTC TTG GGC TTG GGC 2016 
Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly 
25 660 665 670 

TTT AGT CTG GGC TAT GGC TCT ATG TTC ACC AAG ATC TGG TGG GTC CAC 2064 

Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys He Trp Trp Val His 

675 680 685 

30 

ACA GTC TTC ACG AAG AAG GAG GAG AAG AAG GAG TGG AGG AAG ACC CTA 2112 

Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu 
690 695 700 

35 GAG CCC TGG AAA CTC TAT GCC ACT GTG GGC CTG CTG GTG GGC ATG GAT 2160 
Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp 
705 710 715 720 

GTC CTG ACT CTT GCC ATC TGG CAG ATT GTG GAC CCC TTG CAC CGA ACC 2208 
40 Val Leu Thr Leu Ala He Trp Gin He Val Asp Pro Leu His Arg Thr 

725 730 735 

ATT GAG ACT TTT GCC AAG GAG GAA CCA AAG GAA GAC ATC GAT GTC TCC 2256 
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lie Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp lie Asp Val Ser — 
740 745 750 

ATT CTG CCC CAG TTG GAG CAC TGC AGC TCC AAG AAG ATG AAT ACG TGG 2304 

5 lie Leu Pro Gin Leu Glu His Cys Ser Ser Lys Lys Met Asn Thr Trp 
755 760 765 

CTT GGC ATT TTC TAT GGT TAC AAG GGG CTG CTG CTG CTG CTG GGA ATC 2352 

Leu Gly lie Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly lie 
10 770 775 780 

TTT CTT GCT TAC GAA ACC AAG AGC GTG TCC ACT GAA AAG ATC AAT GAC 2400 

Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys lie Asn Asp 

785 790 795 800 

15 

CAC AGG GCC GTG GGC ATG GCT ATC TAC AAT GTC GCG GTC CTG TGT CTC 2448 

His Arg "Ala Val Gly Met Ala lie Tyr Asn Val Ala Val Leu Cys Leu 
805 810 815 

20 ATC ACT GCT CCT GTG ACC ATG ATC CTT TCC AGT CAG CAG GAC GCA GCC 249 6 

lie Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala 
820 825 830 

TTT GCC TTT GCC TCT CTG GCC ATC GTG TTC TCT TCC TAC ATC ACT CTG 2544 

25 Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr He Thr Leu 
835 840 845 

GTT GTG CTC TTT GTG CCC AAG ATG CGC AGG CTG ATC ACC CGA GGG GAA 2592 

Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He Thr Arg Gly Glu 
30 8 5 0 8 5 5 8 6 0 

TGG CAG TCT GAA ACG CAG GAC ACC ATG AAA ACA GGA TCA TCC ACC AAC 2 640 

Trp Gin Ser Glu Thr Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn 

865 870 875 880 

35 

AAC AAC GAG GAA GAG AAG TCC CGA CTG TTG GAG AAG GAA AAC CGA GAA 2688 

Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu 
885 890 895 

40 CTG GAA AAG ATC ATC GCT GAG AAA GAG GAG CGC GTC TCT GAA CTG CGC 273 6 

Leu Glu Lys He He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg 
900 905 910 ' 
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CAT CAG CTC CAG TCT CGG 
His Gin Leu Gin Ser Arg 
915 

ACA CCC CCA GAT CCC TCT 
Thr Pro Pro Asp Pro Ser 
930 

CCT GAC CGG CTT AGC TGT 
Pro Asp Arg Leu Ser Cys 
945 950 

TGA 



CAG CAA CTC CGC TCA CGG 
Gin Gin Leu Arg Ser Arg 
920 

GGG GGC CTT CCC AGG GGA 
Gly Gly Leu Pro Arg Gly 
935 940 

GAT GGG AGT CGA GTA CAT 
Asp Gly Ser Arg Val His 
955 



CGC CAC CCC CCA 273*4 

Arg His Pro Pro 

925 

CCC TCT GAG CCC 283 2 

Pro Ser Glu Pro 

TTG CTT TAC AAG 2880 
Leu Leu Tyr Lys 
960 

2883 



(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 960 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45: 

Met Leu Leu Leu Leu Leu Val Pro Leu Phe Leu Arg Pro Leu Gly Ala 
15 10 15 

Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie lie 
20 25 30 

His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg Asp 
35 40 45 

Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu Tyr 
50 55 60 

Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys Cys 
65 70 75 80 

Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys Val 
H1865-1 WO SEQ 23 
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85 90 '95 

Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val Phe 
100 105 HO 

Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Glu Phe 
115 120 125 

Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser Val Cys 
130 135 140 

Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn Arg 
145 150 155 160 

Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly Ala Leu Phe Pro 
165 170 175 

Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu 
180 185 190 

Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp Tyr 
195 200 205 

Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala 
210 215 220 

Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro He Lys lie He 
225 230 235 240 

Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala 
245 250 255 

Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala 
260 265 270 

Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser 
275 280 285 

Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly 
290 295 300 

Trp Lys Lys He Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr Ser 
305 310 315 320 
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Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu He 
325 330 335 

5 Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn 
340 345 350 

Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu Phe Tyr Glu Thr 
355 360 365 

10 

Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly 
370 375 380 

Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp Phe 
15 385 390 395 400 

Lys Thr Tyr Asp Pro Ser He Asn Cys Thr Val Glu Glu Met Thr Glu 
405 410 415 

20 Ala Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro Ala 
420 425 430 

Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys 
435 440 445 

25 

Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin 
450 455 460 

Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu 
30 465 470 475 480 

Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp 
485 490 495 

35 Phe Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met 
500 505 510 

Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala 
515 520 525 

40 

Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly 
530 535 540 
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Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser 
545 550 555 560 

Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin 
5 565 570 575 

Thr Leu Val He Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe He 
580 585 590 

10 Ser Val Ser Val Leu Ser Ser Leu Gly He Val Leu Ala Val Val Cys 
595 600 605 

Leu Ser Phe Asn He Tyr Asn Ser His Val Arg Tyr He Gin Asn Ser 
610 615 620 

15 

Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu 
625 630 635 640 

Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His He Gly Arg Ser 
20 645 650 655 

Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly 
660 665 670 

25 Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys He Trp Trp Val His 
675 680 685 

Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu 
690 695 700 

30 

Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp 
705 710 715 720 

Val Leu Thr Leu Ala He Trp Gin He Val Asp Pro Leu His Arg Thr 
35 725 730 735 

He Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val Ser 
740 745 750 

40 He Leu Pro Gin Leu Glu His Cys Ser Ser Lys Lys Met Asn Thr Trp 
755 760 765 

Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He 
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770 775 780 

Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp 
785 790 795 800 

His Arg Ala Val Gly Met Ala lie Tyr Asn Val Ala Val Leu Cys Leu 
805 810 815 

He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala 
820 825 830 

Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr He Thr Leu 
835 840 845 

Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He Thr Arg Gly Glu 
850 855 860 

Trp Gin Ser Glu Thr Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn 
865 870 875 880 

Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu 
885 890 895 

Leu Glu Lys He He Ala Glu Lys Glu. Glu Arg Val Ser Glu Leu Arg 
900 905 910 

His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro 
915 920 925 

Thr Pro Pro Asp Pro Ser Gly Gly Leu Pro Arg Gly Pro Ser Glu Pro 
930 935 940 

Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys 
945 950 955 960 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D) TOPOLOGY : 1 inear 
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(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Rattus norwegicus 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION :1. .2532 

(x) PUBLICATION INFORMATION: 
(A) AUTHORS: Kaupmann, K 



Huggel , K 
Heid, J 
Flor, P 
Bischoff, M 
Mickel, S 
McMaster, G 
Angst, C 
Bittiger, H 
Froestl, W 



(B) TITLE: Expression cloning of GABA-B receptors 

uncovers similarity to metabotropic glutamate 
receptors 

(C) JOURNAL: Nature 
<D) VOLUME: 3 86 

(F) PAGES: 239-246 

(G) DATE: 20 March-1997 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

ATG GGC CCG GGG GGA CCC TGT ACC CCA GTG GGG TGG CCG CTG CCT CTT 
Met Gly Pro Gly Gly Pro Cys Thr Pro Val Gly Trp Pro Leu Pro Leu 



CTG CTG GTG ATG GCG GCT GGG GTG GCT CCG GTG TGG GCC TCT CAC TCC 
Leu Leu Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 



1 



5 



10 



15 



20 



25 



30 
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CCT CAT CTC CCG CGG CCT CAC CCG AGG GTC CCC CCG CAC CCC TCC TCA 
Pro His Leu Pro Arg Pro His Pro Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

5 GAA CGG CGT GCA GTA TAC ATC GGG GCG CTG TTT CCC ATG AGC GGG GGC 192 
Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

TGG CCG GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG ATG GCG CTG GAG 240 
10 Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 

GAC GTT AAC AGC CGC AGA GAC ATC CTG CCG GAC TAC GAG CTC AAG CTT 288 
Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp Tyr Glu Leu Lys Leu 
15 85 90 95 

ATC CAC CAC GAC AGC AAG TGT GAC CCA GGG CAA GCC ACC AAG TAC TTG 336 

He His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 

100 105 110 

20 

TAC GAA CTA CTC TAC AAT GAC CCC ATC AAG ATC ATT CTC ATG CCT GGC 3 84 

Tyr Glu Leu Leu Tyr Asn Asp Pro He Lys He He Leu Met Pro Gly 

115 120 125 

25 TGT AGT TCT GTC TCC AC A CTT GTA GCT GAG GCT GCC CGG ATG TGG AAC 432 
Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 



30 



35 



40 



CTT ATT GTG CTC TCA TAT GGC TCC AGT TCA CCA GCC TTG TCA AAC CGA 480 
Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
145 150 155 160 

CAG CGG TTT CCC ACG TTC TTC CGG ACG CAT CCA TCC GCC ACA CTC CAC 52 8 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
165 170 175 

AAT CCC ACC CGG GTG AAA CTC TTC GAA AAG TGG GGC TGG AAG AAG ATC 576 
Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys He 
180 185 190 

GCT ACC ATC CAA CAG ACC ACC GAG GTC TTC ACC TCA ACG CTG GAT GAC 624 
Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 
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CTG GAG GAG CGA GTG AAA GAG GCT GGG ATC GAG ATC ACT TTC CGA CAG 672 
Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
210 215 220 

5 

AGT TTC TTC TCG GAT CCA GCT GTG CCT GTT AAA AAC CTG AAG CGT CAA 720 
Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

10 GAT GCT CGA ATC ATC GTG GGA CTT TTC TAT GAG ACG GAA GCC CGG AAA 76 8 

Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 

GTT TTT TGT GAG GTC TAT AAG GAA AGG CTC TTT GGG AAG AAG TAC GTC 816 
15 Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

TGG TTC CTC ATC GGG TGG TAT GCT GAC AAC TGG TTC AAG ACC TAT GAC 864 
Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys Thr Tyr Asp 
20 275 280 285 

CCG TCA ATC AAT TGT ACA GTG GAA GAA ATG ACC GAG GCG GTG GAG GGC 912 

Pro Ser lie Asn Cys Thr Val Glu Glu Met Thr Glu Ala Val Glu Gly 

290 295 300 

25 

CAC ATC ACC ACG GAG ATT GTC ATG CTG AAC CCT GCC AAC ACC CGA AGC 960 

His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 

30 ATT TCC AAC ATG ACG TCA CAG GAA TTT GTG GAG AAA CTA ACC AAG CGG 1008 
lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
325 330 335 

CTG AAA AGA CAC CCC GAG GAG ACT GGA GGC TTC CAG GAG GCA CCA CTG 1056 
35 Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 

GCC TAT GAT GCT ATC TGG GCC TTG GCT TTG GCC TTG AAC AAG ACG TCT 1104 
Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
40 355 360 365 

GGA GGA GGT GGT CGT TCC GGC GTG CGC CTG GAG GAC TTT AAC TAC AAC 1152 
Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
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370 375 380 

AAC CAG ACC ATT ACA GAC CAG ATC TAC CGG GCC ATG AAC TCC TCC TCC 1200 
Asn Gin Thr lie Thr Asp Gin He Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 

TTT GAG GGC GTT TCT GGC CAT GTG GTC TTT GAT GCC AGC GGC TCC CGG 1248 
Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

ATG GCA TGG ACA CTT ATC GAG CAG CTA CAG GGC GGC AGC TAC AAG AAG 1296 
Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

ATC GGC TAC TAC GAC AGC ACC AAG GAT GAT CTT TCC TGG TCC AAA ACG 13 44 

He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

GAC AAG TGG ATT GGA GGG TCT CCC CCA GCT GAC CAG ACC TTG GTC ATC 13 92 

Asp Lys Trp He Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val He 
450 455 460 

AAG ACA TTC CGT TTC CTG TCT CAG AAA CTC TTT ATC TCC GTC TCA GTT 1440 
Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe He Ser Val Ser Val 
465 470 475 480 

CTC TCC AGC CTG GGC ATT GTT CTT GCT GTT GTC TGT CTG TCC TTT AAC 1488 
Leu Ser Ser Leu Gly He Val Leu Ala Val Val Cys Leu Ser Phe Asn 
485 490 495 

ATC TAC AAC TCC CAC GTT CGT TAT ATC CAG AAC TCC CAG CCC AAC CTG 153 6 

lie Tyr Asn Ser His Val Arg Tyr He Gin Asn Ser Gin Pro Asn Leu 
500 505 510 

AAC AAT CTG ACT GCT GTG GGC TGC TCA CTG GCA CTG GCT GCT GTC TTC 1584 
Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
515 520 525 

CCT CTC GGG CTG GAT GGT TAC CAC ATA GGG AGA AGC CAG TTC CCG TTT 1632 
Pro Leu Gly Leu Asp Gly Tyr His He Gly Arg Ser Gin Phe Pro Phe 
530 535 540 

GTC TGC CAG GCC CGC CTT TGG CTC TTG GGC TTG GGC TTT AGT CTG GGC 1680 
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Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
545 550 555 560 

TAT GGC TCT ATG TTC ACC AAG ATC TGG TGG GTC CAC AC A GTC TTC ACG 1728 
Tyr Gly Ser Met Phe Thr Lys lie Trp Trp Val His Thr Val Phe Thr 
565 570 575 

AAG AAG GAG GAG AAG AAG GAG TGG AGG AAG ACC CTA GAG CCC TGG AAA 1776 
Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
580 585 590 

CTC TAT GCC ACT GTG GGC CTG CTG GTG GGC ATG GAT GTC CTG ACT CTT 1824 
Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
595 600 605 

GCC ATC TGG CAG ATT GTG GAC CCC TTG CAC CGA ACC ATT GAG ACT TTT 1872 
Ala He Trp Gin He Val Asp Pro Leu His Arg Thr lie Glu Thr Phe 
610 615 620 

20 GCC AAG GAG GAA CCA AAG GAA GAC ATC GAT GTC TCC ATT CTG CCC CAG 1920 
Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val Ser lie Leu Pro Gin 
625 630 635 640 

TTG GAG CAC TGC AGC TCC AAG AAG ATG AAT ACG TGG CTT GGC ATT TTC 1968 
25 Leu Glu His Cys Ser Ser Lys Lys Met Asn Thr Trp Leu Gly He Phe 

645 650 655 

TAT GGT TAC AAG GGG CTG CTG CTG CTG CTG GGA ATC TTT CTT GCT TAC 2016 
Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He Phe Leu Ala Tyr 
30 6 6 0 6 6 5 6 7 0 

GAA ACC AAG AGC GTG TCC ACT GAA AAG ATC AAT GAC CAC AGG GCC GTG 2064 
Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp His Arg Ala Val 
675 680 685 



35 



40 



GGC ATG GCT ATC TAC AAT GTC GCG GTC CTG TGT CTC ATC ACT GCT CCT 2112 
Gly Met Ala He Tyr Asn Val Ala Val Leu Cys Leu He Thr Ala Pro 
690 695 700 

GTG ACC ATG ATC CTT TCC AGT CAG CAG GAC GCA GCC TTT GCC TTT GCC 2160 
Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 
705 710 715 * 720 
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TCT CTG GCC ATC 
Ser Leu Ala. lie 

5 GTG CCC AAG ATG 
Val Pro Lys Met 
740 

ACG CAG GAC ACC 
10 Thr Gin Asp Thr 
755 

GAG AAG TCC CGA 
Glu Lys Ser Arg 
15 770 

ATC GCT GAG AAA 
lie Ala Glu Lys 
785 

20 

TCT CGG CAG CAA 
Ser Arg Gin Gin 

25 CCC TCT GGG GGC 
Pro Ser Gly Gly 
820 

AGC TGT GAT GGG 
30 Ser Cys Asp Gly 
835 



GTG TTC TCT TCC TAC 
Val Phe Ser Ser Tyr 
725 

CGC AGG CTG ATC ACC 
Arg Arg Leu He Thr 
745 

ATG AAA ACA GGA TCA 
Met Lys Thr Gly Ser 
760 

CTG TTG GAG AAG GAA 
Leu Leu Glu Lys Glu 
775 

GAG GAG CGC GTC TCT 
Glu Glu Arg Val Ser 
790 

CTC CGC TCA CGG CGC 
Leu Arg Ser Arg Arg 
805 

CTT CCC AGG GGA CCC 
Leu Pro Arg Gly Pro 
825 

AGT CGA GTA CAT TTG 
Ser Arg Val His Leu 
840 



ATC ACT CTG GTT GTG 
lie Thr Leu Val Val 
730 

CGA GGG GAA TGG CAG 
Arg Gly Glu Trp Gin 
750 

TCC ACC AAC AAC AAC 
Ser Thr Asn Asn Asn 
765 

AAC CGA GAA CTG GAA 
Asn Arg Glu Leu Glu 
780 

GAA CTG CGC CAT CAG 
Glu Leu Arg His Gin 
795 

CAC CCC CCA ACA CCC 
His Pro Pro Thr Pro 
810 

TCT GAG CCC CCT GAC 
Ser Glu Pro Pro Asp 
830 

CTT TAC AAG TGA 
Leu Tyr Lys * 



CTC TTT 22^8 

Leu Phe 

735 

TCT GAA 225 6 

Ser Glu 

GAG GAA 2304 
Glu Glu 

AAG ATC 23 52 

Lys He 

CTC CAG 2400 
Leu Gin 
800 

CCA GAT 2448 

Pro Asp 

815 

CGG CTT 2496 
Arg Leu 

2535 



(2) INFORMATION FOR SEQ ID NO: 47: 

35 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

40 

<ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
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Met Gly Pro Gly Gly Pro Cys Thr Pro Val Gly Trp Pro Leu Pro Leu 
15 10 15 

Leu Leu Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 30 

Pro His Leu Pro Arg Pro His Pro Arg Val Pro Pro His Pro Ser Ser 
35 40 45 



Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 

Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
85 90 95 



lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
115 120 125 

Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
145 150 155 160 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
165 170 175 

Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 

Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu He Thr Phe Arg Gin 
210 215 220 
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Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin ~ 
225 230 235 240 

Asp Ala Arg He He Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp Phe Lys Thr Tyr Asp 
275 280 285 

Pro Ser He Asn Cys Thr Val Glu Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 

He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
325 330 335 

Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 

Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 

Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 

Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

Asp Lys Trp He Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val He 
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450 455 460 

Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie Ser Val Ser Val 
■ 465 470 475 480 

5 

Leu Ser Ser Leu Gly lie Val Leu Ala Val Val Cys Leu Ser Phe Asn 
485 490 495 

lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn Ser Gin Pro Asn Leu 
10 500 505 510 

Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
515 520 525 

15 Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Ser Gin Phe Pro Phe 
530 535 540 

Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
545 550 555 560 

20 

Tyr Gly Ser Met Phe Thr Lys lie Trp Trp Val His Thr Val Phe Thr 
565 570 575 

Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
25 580 585 590 

Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
595 600 605 

30 Ala lie Trp Gin lie Val Asp Pro Leu His Arg Thr lie Glu Thr Phe 
610 615 620 

Ala Lys Glu Glu Pro Lys Glu Asp lie Asp Val Ser lie Leu Pro Gin 
625 630 635 640 

35 

Leu Glu His Cys Ser Ser Lys Lys Met Asn Thr Trp Leu Gly lie Phe 
645 650 655 

Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly lie Phe Leu Ala Tyr 
40 660 665 670 

Glu Thr Lys Ser Val Ser Thr Glu Lys lie Asn Asp His Arg Ala Val 
675 680 685 
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Gly Met Ala lie 
690 

Val Thr Met lie 
705 

Ser Leu Ala lie 



Val Pro Lys Met 
740 

Thr Gin Asp Thr 
755 

Glu Lys Ser Arg 
770 

lie Ala Glu Lys 
785 

Ser Arg Gin Gin 



Pro Ser Gly Gly 
820 

Ser Cys Asp Gly 
835 



Tyr Asn Val Ala 
695 

Leu Ser Ser Gin 
710 

Val Phe Ser Ser 
725 

Arg Arg Leu lie 



Met Lys Thr Gly 
760 

Leu Leu Glu Lys 
775 

Glu Glu Arg Val 
790 

Leu Arg Ser Arg 
805 

Leu Pro Arg Gly 



Ser Arg Val His 
840 



Val Leu Cys Leu 
700 

Gin Asp Ala Ala 
715 

Tyr lie Thr Leu 
730 

Thr Arg Gly Glu 
745 

Ser Ser Thr Asn 



Glu Asn Arg Glu 
780 

Ser Glu Leu Arg 
795 

Arg His Pro Pro 
810 

Pro Ser Glu Pro 
825 

Leu Leu Tyr Lys 



lie Thr Ala Pro 



Phe Ala Phe Ala 
720 

Val Val Leu Phe 
735 

Trp Gin Ser Glu 
750 

Asn Asn Glu Glu 
765 

Leu Glu Lys lie 



His Gin Leu Gin 
800 

Thr Pro Pro Asp 
815 

Pro Asp Arg Leu 
830 



(2). INFORMATION FOR SEQ ID NO : 48: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 2886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO 



(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION : 1 . .2886 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 



ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 1 5 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

20 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

25 GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
30 Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 
Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
35 85 90 95. 

GTC CGA ATC TGC TCC AAG TCT TAT TTG ACC CTG GAA AAT GGG AAG GTT 33 6 

Val Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 110 



TTC CTG ACG GGT GGG GAC CTC CCA GCT CTG GAC GGA GCC CGG GTG GAT 3 84 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 

115 120 125 
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TTC CGG TGT GAC CCC GAC TTC CAT CTG GTG GGC AGC TCC CGG AGC ATC 432 

Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser lie 

130 135 140 

5 

TGT AGT CAG GGC CAG TGG AGC ACC CCC AAG CCC CAC TGC CAG GTG AAT 480 

Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn 
145 150 155 160 

10 CGA ACG CCA CAC TCA GAA CGG CGC GCA GTG TAC ATC GGG GCA CTG TTT 528 
Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe 
165 170 175 

CCC ATG AGC GGG GGC TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG 576 
15 Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val 
180 185 190 

GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC 624 
Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp 
20 195 200 205 

TAT GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA 672 
Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro Gly Gin 
210 215 220 

25 

GCC ACC AAG TAC CTA TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC 720 
Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie 
225 230 235 240 

30 ATC CTT ATG CCT GGC TGC AGC TCT GTC TCC ACG CTG GTG GCT GAG GCT 768 
lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala 
245 250 255 

GCT AGG ATG TGG AAC CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA 816 
35 Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro 
260 265 270 

GCC CTG TCA AAC CGG CAG CGT TTC CCC ACT TTC TTC CGA ACG CAC CCA 864 
Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 
40 275 280 285 

TCA GCC ACA CTC CAC AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG 912 
Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp 
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290 295 300 

GGC TGG AAG AAG ATT GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT 960 
Gly Trp Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr 
305 310 315 320 

TCG ACT CTG GAC GAC CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG 1008 
Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu 
325 330 335 

ATT ACT TTC CGC CAG AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA 1056 
He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys 
340 345 350 

15 AAC CTG AAG CGC CAG GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG 1104 
Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu Phe Tyr Glu 
355 360 365 

ACT GAA GCC CGG AAA GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT 1152 
20 Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe 
370 375 380 

GGG AAG AAG TAC GTC TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG 1200 
Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp 
25 385 390 395 400 

TTC AAG ATC TAC GAC CCT TCT ATC AAC TGC AC A GTG GAT GAG ATG ACT 1248 

Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr 
405 410 415 

30 

GAG GCG GTG GAG GGC CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT 1296 

Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro 
420 425 430 

35 GCC AAT ACC CGC AGC ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG 1344 
Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu 
435 440 445 

AAA CTA ACC AAG CGA CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC 1392 
40 Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe 
450 455 460 

CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC^ 1440 
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Gin Glu Ala Pro Leu Ala 
465 470 

CTG AAC AAG ACA TCT GGA 
5 Leu Asn Lys Thr Ser Gly 

485 

GAC TTC AAC TAC AAC AAC 
Asp Phe Asn Tyr Asn Asn 
10 500 

ATG AAC TCT TCG TCC TTT 
Met Asn Ser Ser Ser Phe 
515 

15 

GCC AGC GGC TCT CGG ATG 
Ala Ser Gly Ser Arg Met 
530 



Tyr Asp Ala lie Trp Ala 
475 

GGA GGC GGC CGT TCT GGT 
Gly Gly Gly Arg Ser Gly 
490 

CAG ACC ATT ACC GAC CAA 
Gin Thr lie Thr Asp Gin 
5.05 

GAG GGT GTC TCT GGC CAT 
Glu Gly Val Ser Gly His 
520 

GCA TGG ACG CTT ATC GAG 
Ala Trp Thr Leu lie Glu 
535 540 



PCT/SE98/01947 

Leu Ala Leu Ala 
480 

GTG CGC CTG GAG 148 8 

Val Arg Leu Glu 
495 

ATC TAC CGG GCA 153 6 

lie Tyr Arg Ala 
510 

GTG GTG TTT GAT 1584 

Val Val Phe Asp 

525 

CAG CTT CAG GGT 1632 
Gin Leu Gin Gly 



20 GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT 1680 

Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu 
545 550 555 560 

TCC TGG TCC AAA ACA GAT AAA TGG ATT GGA GGG TCC CCC CCA GCT GAC 1728 

25 Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp 

565 570 575 



CAG ACC CTG GTC ATC AAG ACA TTC CGC TTC CTG TCA CAG AAA CTC TTT 1776 

Gin Thr Leu Val lie Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe 
30 5 8 0 5 8 5 5 9 0 

ATC TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT GTC CTA GCT GTT GTC 1824 

lie Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala Val Val 
595 600 605 

35 

TGT CTG TCC TTT AAC ATC TAC AAC TCA CAT GTC CGT TAT ATC CAG AAC 1872 

Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn 

610 615 620 

40 TCA CAG CCC AAC CTG AAC AAC CTG ACT GCT GTG GGC TGC TCA CTG GCT 1920*., 

Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala 
625 630 635 640 
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TTA GCT GCT GTC TTC CCC CTG GGG CTC GAT GGT TAG CAC ATT GGG AGG 19"6-8 
Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg 
645 650 655 

5 AAC CAG TTT CCT TTC GTC TGC CAG GCC CGC CTC TGG CTC CTG GGC CTG 2016 
Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu 
660 665 670 

GGC TTT AGT CTG GGC TAC GGT TCC ATG TTC ACC AAG ATT TGG TGG GTC 2064 
10 Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie Trp Trp Val 
675 680 685 

CAC ACG GTC TTC AC A AAG AAG GAA GAA AAG AAG GAG TGG AGG AAG ACT 2112 
His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr 
15 690 695 700 

CTG GAA CCC TGG AAG CTG TAT GCC ACA GTG GGC CTG CTG GTG GGC ATG 2160 

Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met 

705 710 715 720 

20 

GAT GTC CTC ACT CTC GCC ATC TGG CAG ATC GTG GAC CCT CTG CAC CGG 2208 

Asp Val Leu Thr Leu Ala lie Trp Gin lie Val Asp Pro Leu His Arg 
725 730 735 

25 ACC ATT GAG ACA TTT GCC AAG GAG GAA CCT AAG GAA GAT ATT GAC GTC 2256 
Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp lie Asp Val 
740 745 750 

TCT ATT CTG CCC CAG CTG GAG CAT TGC AGC TCC AGG AAG ATG AAT ACA 23 04 

30 Ser lie Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr 
755 760 765 

TGG CTT GGC ATT TTC TAT GGT TAC AAG GGG CTG CTG CTG CTG CTG GGA 2352 
Trp Leu Gly lie Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly 
35 770 775 780 

ATC TTC CTT GCT TAT GAG ACC AAG AGT GTG TCC ACT GAG AAG ATC AAT 2400 
lie Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys lie Asn 
785 790 795 800 



40 



GAT CAC CGG GCT GTG GGC ATG GCT ATC TAC AAT GTG GCA GTC CTG TGC 2448 

Asp His Arg Ala Val Gly Met Ala lie Tyr Asn Val Ala Val Leu Cys 
805 810 815 
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CTC ATC ACT GCT CCT GTC ACC ATG ATT CTG TCC AGC CAG CAG GAT GCA 2496 
Leu He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin Asp Ala 
820 825 830 

5 

GCC TTT GCC TTT GCC TCT CTT GCC ATA GTT TTC TCC TCC TAT ATC ACT 2544 
Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr He Thr 
835 840 845 

10 CTT GTT GTG CTC TTT GTG CCC AAG ATG CGC AGG CTG ATC ACC CGA GGG 2592 
Leu Val Val Leu Phe Val Pro Lys" Met Arg Arg Leu He Thr Arg Gly 
850 855 860 

GAA TGG CAG TCG GAG GCG CAG GAC ACC ATG AAG ACA GGG TCA TCG ACC 2640 
15 Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr 
865 870 875 880 

AAC AAC AAC GAG GAG GAG AAG TCC CGG CTG TTG GAG AAG GAG AAC CGT 2688 
Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg 
20 885 890 895 

GAA CTG GAA AAG ATC ATT GCT GAG AAA GAG GAG CGT GTC TCT GAA CTG 273 6 

Glu Leu Glu Lys He He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu 
900 905 910 

25 

CGC CAT CAA CTC CAG TCT CGG CAG CAG CTC CGC TCC CGG CGC CAC CCA 2784 
Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro 
915 920 925 

30 CCG ACA CCC CCA GAA CCC TCT GGG GGC CTG CCC AGG GGA CCC CCT GAG 2832 
Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu 
930 935 940 

CCC CCC GAC CGG CTT AGC TGT GAT GGG AGT CGA GTG CAT TTG CTT TAT 2880 
35 Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr 
945 950 955 960 

AAG TGA 2886 
Lys * 

40 



(2) INFORMATION FOR SEQ ID NO: 49: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 961 amino acids 

(B) TYPE: amino acid 
5 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

10 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
15 20 25 30 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

20 Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

25 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

Val Arg He Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
30 100 105 HO 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 

35 Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser He 
130 135 140 

Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn 
145 150 155 160 

40 

Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly Ala Leu Phe 
165 170 ' 175 
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Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val 
180 185 190 

Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp 
5 195 200 205 

Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro Gly Gin 
210 215 220 

10 Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie 
225 230 235 240 

lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala 
245 250 255 

15 

Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro 
260 265 270 

Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 
20 275 280 285 

Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp 
290 295 300 

25 Gly Trp Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr 
305 310 315 320 

Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu 
325 330 335 

30 

lie Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys 
340 345 350 

Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu Phe Tyr Glu 
35 355 360 365 

Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe 
370 375 380 

40 Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp 
385 390 395 400 

Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr 
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405 410 415 

Glu Ala Val Glu Gly His lie Thr Thr Glu lie Val Met Leu Asn Pro 
420 425 430 

5 

Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu 
435 440 445 

Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe 
10 450 455 460 

Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala 
465 470 475 480 

15 Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu 

485 490 495 

Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala 
500 505 510 

20 

Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp 
515 520 525 

Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly 
25 530 535 540 

Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu 
545 550 555 560 

30 Ser Trp Ser Lys Thr Asp Lys Trp He Gly Gly Ser Pro Pro Ala Asp 

565 570 575 

Gin Thr Leu Val He Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe 
580 585 590 

35 

He Ser Val Ser Val Leu Ser Ser Leu Gly He Val Leu Ala Val Val 
595 600 605 

Cys Leu Ser Phe Asn He Tyr Asn Ser His Val Arg Tyr He Gin Asn 
40 610 615 620 

Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala 
625 630 635 640 
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Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg 
645 650 655 

5 Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu 
660 665 670 

Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys He Trp Trp Val 
675 680 685 

10 

His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr 
690 695 700 

Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met 
15 705 710 715 720 

Asp Val Leu Thr Leu Ala He Trp Gin He Val Asp Pro Leu His Arg 
725 730 735 

20 Thr He Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val 
740 745 750 

Ser He Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr 
755 760 765 

25 

Trp Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly 
770 775 780 

He Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn 
30 785 790 795 800 

Asp His Arg Ala Val Gly Met Ala He Tyr Asn Val Ala Val Leu Cys 
805 810 815 

35 Leu He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin Asp Ala 
820 825 830 

Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr He Thr 
835 840 845 

40 

Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He Thr Arg Gly 
850 855 860 
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Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Giy Ser Ser Thr 
865 870 875 880 

Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg 
5 885 890 895 

Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu Arg Val Ser Glu Leu 
900 905 910 

10 Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro 

915 920 925 

Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu 
930 935 940 



15 



20 



Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr 
945 950 955 960 



Lys 



(2) INFORMATION FOR SEQ ID NO: 50: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 253 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

(D) TOPOLOGY: linear 

30 

(ii) MOLECULE TYPE: cDNA to rRNA 

(iii) HYPOTHETICAL: NO 

35 (iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

40 (ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1 . .2535 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: — 

ATG GGG CCC GGG GCC CCT TTT GCC CGG GTG GGG TGG CCA CTG CCG CTT 48 
Met Gly Pro Gly Ala Pro Phe Ala Arg Val Gly Trp Pro Leu Pro Leu 
1 5 10 15 

CTG GTT GTG ATG GCG GCA GGG GTG GCT CCG GTG TGG GCC TCC CAC TCC 9 6 

Leu Val Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 30 

CCC CAT CTC CCG CGG CCT CAC TCG CGG GTC CCC CCG CAC CCC TCC TCA 144 
Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

15 GAA CGG CGC GCA GTG TAC ATC GGG GCA CTG TTT CCC ATG AGC GGG GGC 192 
Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG ATG GCG CTG GAG 240 
20 Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 

GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC TAT GAG CTC AAG CTC 288 
Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
25 85 90 95 

ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA GCC ACC AAG TAC CTA 336 
lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 



30 



35 



40 



TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC ATC CTT ATG CCT GGC 3 84 

Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
115 120 125 

TGC AGC TCT GTC TCC ACG CTG GTG GCT GAG GCT GCT AGG ATG TGG AAC 432 
Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA GCC CTG TCA AAC CGG 480 
Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg u 
145 150 155 ^ 160 

CAG CGT TTC CCC ACT TTC TTC CGA ACG CAC CCA TCA GCC ACA CTC CAC 528 
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Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His — 
165 170 175 

AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG GGC TGG AAG AAG ATT 576 
5 Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 

GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT TCG ACT CTG GAC GAC 624 
Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
10 195 200 205 

CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG ATT ACT TTC CGC CAG 672 

Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 

210 215 220 

15 

AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA AAC CTG AAG CGC CAG 720 

Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 

225 230 235 240 

20 GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG ACT GAA GCC CGG AAA 768 
Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 

GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT GGG AAG AAG TAC GTC 816 
25 Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG TTC AAG ATC TAC GAC 864 
Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys lie Tyr Asp 
30 275 280 285 

CCT TCT ATC AAC TGC ACA GTG GAT GAG ATG ACT GAG GCG GTG GAG GGC 912 
Pro Ser lie Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

35 

CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT GCC AAT ACC CGC AGC 960 
His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 

40 ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG AAA CTA ACC AAG CGA 1008 
lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
325 330 *335 
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CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC CAG GAG GCA CCG CTG 105-6 
Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 

5 GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC CTG AAC AAG ACA TCT 1104 
Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 

GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG GAC TTC AAC TAC AAC 1152 
10 Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA ATG AAC TCT TCG TCC 1200 
Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met Asn Ser Ser Ser 
15 385 390 395 400 

TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT GCC AGC GGC TCT CGG 1248 

Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

20 

ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT GGC AGC TAC AAG AAG 1296 

Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

25 ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT TCC TGG TCC AAA ACA 1344 
lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

GAT AAA TGG ATT GGA GGG TCC CCC CCA GCT GAC CAG ACC CTG GTC ATC 1392 
30 Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val He 
450 455 460 

AAG ACA TTC CGC TTC CTG TCA CAG AAA CTC TTT ATC TCC GTC TCA GTT 1440 
Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe He Ser Val Ser Val 
35 465 470 475 480 

CTC TCC AGC CTG GGC ATT GTC CTA GCT GTT GTC TGT CTG TCC TTT AAC 1488 
Leu Ser Ser Leu Gly He Val Leu Ala Val Val Cys Leu Ser Phe Asn 
485 490 495 



40 



ATC TAC AAC TCA CAT GTC CGT TAT ATC CAG AAC TCA CAG CCC AAC CTG 1536 
He Tyr Asn Ser His Val Arg Tyr He Gin Asn Ser Gin Pro Asn Leu 
500 505 510 
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AAC AAC CTG ACT GCT GTG GGC TGC TCA CTG GCT TTA GCT GCT GTC TTC 1584 
Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
515 520 525 

CCC CTG GGG CTC GAT GGT TAC CAC ATT GGG AGG AAC CAG TTT CCT TTC 1632 
Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Asn Gin Phe Pro Phe 
530 535 540 

GTC TGC CAG GCC CGC CTC TGG CTC CTG GGC CTG GGC TTT AGT CTG GGC 1680 
Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
545 550 555 560 

TAC GGT TCC ATG TTC ACC AAG ATT TGG TGG GTC CAC ACG GTC TTC ACA 172 8 

Tyr Gly Ser Met Phe Thr Lys He Trp Trp Val His Thr Val Phe Thr 
565 570 575 

AAG AAG GAA GAA AAG AAG GAG TGG AGG AAG ACT CTG GAA CCC TGG AAG 1776 
Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
580 585 590 

CTG TAT GCC ACA GTG GGC CTG CTG GTG GGC ATG GAT GTC CTC ACT CTC 1824 
Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
595 600 605 

GCC ATC TGG CAG ATC GTG GAC CCT CTG CAC CGG ACC ATT GAG ACA TTT 1872 
Ala He Trp Gin He Val Asp Pro Leu His Arg Thr He Glu Thr Phe 
610 615 620 

GCC AAG GAG GAA CCT AAG GAA GAT ATT GAC GTC TCT ATT CTG CCC CAG 1920 
Ala Lys Glu Glu Pro Lys Glu Asp He Asp Val Ser He Leu Pro Gin 
625 630 635 640 

CTG GAG CAT TGC AGC TCC AGG AAG ATG AAT ACA TGG CTT GGC ATT TTC 1968 
Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr Trp Leu Gly He Phe 
645 650 655 

TAT GGT TAC AAG GGG CTG CTG CTG CTG CTG GGA ATC TTC CTT GCT TAT 2016 
Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He Phe Leu Ala Tyr 
660 665 670 

GAG ACC AAG AGT GTG TCC ACT GAG AAG ATC AAT GAT CAC CGG GCT GTG 2064 
Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp His Arg Ala Val 
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675 680 685 ~ 

GGC ATG GCT ATC TAC AAT GTG GCA GTC CTG TGC CTC ATC ACT GCT CCT 2112 
Gly Met Ala lie Tyr Asn Val Ala Val Leu Cys Leu He Thr Ala Pro 
690 695 700 

GTC ACC ATG ATT CTG TCC AGC CAG CAG GAT GCA GCC TTT GCC TTT GCC 2160 
Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 
705 710 715 720 

TCT CTT GCC ATA GTT TTC TCC TCC TAT ATC ACT CTT GTT GTG CTC TTT 2208 
Ser Leu Ala He Val Phe Ser Ser Tyr He Thr Leu Val Val Leu Phe 
725 730 735 

GTG CCC AAG ATG CGC AGG CTG ATC ACC CGA GGG GAA TGG CAG TCG GAG 2256 
Val Pro Lys Met Arg Arg Leu He Thr Arg Gly Glu Trp Gin Ser Glu 
740 745 750 

GCG CAG GAC ACC ATG AAG ACA GGG TCA TCG ACC AAC AAC AAC GAG GAG 23 04 

Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu 
755 760 765 

GAG AAG TCC CGG CTG TTG GAG AAG GAG AAC CGT GAA CTG GAA AAG ATC 2352 
Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys He 
770 775 780 

ATT GCT GAG AAA GAG GAG CGT GTC TCT GAA CTG CGC CAT CAA CTC CAG 2400 
He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg His Gin Leu Gin 
785 790 795 800 

TCT CGG CAG CAG CTC CGC TCC CGG CGC CAC CCA CCG ACA CCC CCA GAA 2448 
Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro Thr Pro Pro Glu 
805 810 815 

CCC TCT GGG GGC CTG CCC AGG GGA CCC CCT GAG CCC CCC GAC CGG CTT 2496 
Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu 
820 825 830 



AGC TGT GAT GGG AGT CGA GTG CAT TTG CTT TAT AAG TGA 
Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys * 
835 840 



2535 
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(2) INFORMATION FOR SEQ ID NO: 51: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 844 amino acids 
5 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO : 51: 

Met Gly Pro Gly Ala Pro Phe Ala Arg Val Gly Trp Pro Leu Pro Leu 
1 5 10 15 

15 Leu Val Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 30 

Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

20 

Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
25 65 70 75 80 

Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
85 90 95 

30 lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
115 120 125 

35 

Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
40 145 150 155 160 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr 'Leu His 
165 170 175 
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Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 

Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
210 215 220 

Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

Asp Ala Arg He He Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 



Val Phe Cys Glu 
260 

Trp Phe Leu lie 
275 

Pro Ser He Asn 
290 

His He Thr Thr 
305 

He Ser Asn Met 



Val Tyr Lys Glu 



Gly Trp Tyr Ala 
280 

Cys Thr Val Asp 
295 

Glu He Val Met 
310 

Thr Ser Gin Glu 
325 



Arg Leu Phe Gly 
265 

Asp Asn Trp Phe 



Glu Met Thr Glu 
300 

Leu Asn Pro Ala 
315 

Phe Val Glu Lys 
330 



Lys Lys Tyr Val 
270 

Lys He Tyr Asp 
285 

Ala Val Glu Gly 



Asn Thr Arg Ser 
320 

Leu Thr Lys Arg 
335 



Leu Lys Arg His Pro Glu Glu Thr 
340 

Ala Tyr Asp Ala He Trp Ala Leu 

355 360 



Gly Gly Phe Gin Glu Ala Pro Leu 
345 350 

Ala Leu Ala Leu Asn Lys Thr Ser 
365 



Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 * 400 
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Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
5 420 425 430 

lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

10 Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val lie 
450 455 460 

Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie Ser Val Ser Val 
465 470 475 480 

15 

Leu Ser Ser Leu Gly lie Val Leu Ala Val Val Cys Leu Ser Phe Asn 
485 490 495 

lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn Ser Gin Pro Asn Leu 
20 500 505 510 

Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
515 520 525 

25 Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Asn Gin Phe Pro Phe 
530 535 540 

Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
545 550 555 560 

30 

Tyr Gly Ser Met Phe Thr Lys lie Trp Trp Val His Thr Val Phe Thr 
565 570 575 

Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
35 580 585 590 

Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
595 600 605 

40 Ala lie Trp Gin lie Val Asp Pro Leu His Arg Thr lie Glu Thr Phe 
610 615 620 

Ala Lys Glu Glu Pro Lys Glu Asp lie Asp Val Ser lie Leu Pro Gin 
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625 630 635 640 

Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr Trp Leu Gly He Phe 
645 650 655 

5 

Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He Phe Leu Ala Tyr 
660 665 670 

Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp His Arg Ala Val 
10 675 680 685 

Gly Met Ala He Tyr Asn Val Ala Val Leu Cys Leu He Thr Ala Pro 
690 695 700 

15 Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 
705 710 715 720 

Ser Leu Ala He Val Phe Ser Ser Tyr He Thr Leu Val Val Leu Phe 
725 730 735 

20 

Val Pro Lys Met Arg Arg Leu He Thr Arg Gly Glu Trp Gin Ser Glu 
740 745 750 

Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu 
25 755 760 765 

Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys He 
770 775 780 

30 He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg His Gin Leu Gin 
785 790 795 800 

Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro Thr Pro Pro Glu 
805 810 815 

35 

Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu 
820 825 830 

Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys 
40 835 840 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2895 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Canis familiaris 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION : 1 . .2895 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 



ATG CTG CTG CTG CTC CTG CCG CTG GCG CTG GCG CCG CTC TTC CTC CGC 48 
Met Leu Leu Leu Leu Leu Pro Leu Ala Leu Ala Pro Leu Phe Leu Arg 
25 1 5 10 15 

CCC CCG GGC GCG GGC GGG GCA CAG ACC CCC AAC GCC ACC TCG GAA GGT 96 
Pro Pro Gly Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly 
20 25 30 

30 

TGC CAG ATC ATA CAC CCG CCT TGG GAA GGG GGT ATC AGG TAC AGG GGC 144 
Cys Gin lie lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly 
35 40 45 

35 CTG ACT CGT GAC CAG GTG AAG GCT ATC AAC TTC CTG CCG GTG GAC TAT 192 
Leu Thr Arg Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr 
50 55 60 

GAG ATT GAG TAT GTG TGC CGG GGA GAG CGA GAG GTG GTG GGG CCC AAG 240 
40 Glu lie Glu Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys 
65 70 75 80 

GTC CGA AAG TGC CTG GCC AAT GGC TCC TGG ACA GAT ATG GAC ACA CCC 288 
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Val Arg Lys Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro 
85 90 95 

AGC CGC TGT GTC CGA ATC TGT TCC AAG TCA TAT TTG GCC CTG GAA AAT 33 6 

5 Ser Arg Cys Val Arg lie Cys Ser Lys Ser Tyr Leu Ala Leu Glu Asn 
100 105 110 

GGG AAG GTC TTC CTG ACG GGT GGG GAC CTC CCC GCT CTG GAT GGA GCC 3 84 

Gly Lys Val Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala 
10 115 120 125 

CGG GTG GAT TTC CGG TGT GAC CCT GAC TTC CAT CTT GTG GGC AGC TCC 432 

Arg Val Asp Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser 

130 135 140 

15 

CGG AGT ATC TGT AGT CAG GGC CAG TGG AGC ACT CCC AAG CCC CAC TGC 480 

Arg Ser lie Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys 
145 150 155 160 

20 CAG GTG AGC CGA ACG CCG CAC TCA GAG CGG CGA GCG GTG TAC ATC GGG 528 
Gin Val Ser Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly 
165 170 175 

GCG CTG TTT CCC ATG AGC GGG GGC TGG CCG GGG GGC CAG GCC TGC CAG 576 
25 Ala Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin 
180 185 190 

CCC GCG GTG GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC 624 
Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie 
30 195 200 205 

CTG CCG GAC TAC GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG TGT GAC 672 
Leu Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp 
210 215 220 

35 

CCA GGC CAA GCT ACC AAG TAC CTG TAT GAA CTG CTC TAC AAC GAC CCC 720 
Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro 
225 230 235 240 

40 ATC AAG ATC ATC CTC ATG CCT GGC TGC AGC TCT GTC TCC ACG CTT GTG 768. 
lie Lys lie lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val 
245 250 '255 
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GCT GAG GCT GCC AGG ATG 
Ala Glu Ala Ala Arg Met 
260 

5 AGC TCA CCA GCT CTG TCC 
Ser Ser Pro Ala Leu Ser 
275 

ACT CAT CCC TCG GCC ACG 
10 Thr His Pro Ser Ala Thr 
290 



TGG AAC CTC ATT GTG CTC 
Trp Asn Leu lie Val Leu 
265 

AAC CGG CAG CGC TTT CCT 
Asn Arg Gin Arg Phe Pro 
280 

CTC CAC AAC CCT ACG CGA 
Leu His Asn Pro Thr Arg 
295 300 



PCT/SE98/01947 

TCC TAT GGT TCC 8:16 
Ser Tyr Gly Ser 
270 

ACC TTC TTC CGA 864 

Thr Phe Phe Arg 

285 

GTG AAG CTC TTT 912 
Val Lys Leu Phe 



GAG AAG TGG GGC 
Glu Lys Trp Gly 
15 305 

GTG TTC ACA TCG 
Val Phe Thr Ser 

20 

GGG ATT GAG ATT 
Gly He Glu He 
340 

25 CCT GTC AAG AAC 
Pro Val Lys Asn 
355 

TTC TAT GAG ACT 
30 Phe Tyr Glu Thr 
370 

CGG CTC TTT GGG 
Arg Leu Phe Gly 
35 3 85 

GAC AAT TGG TTC 
Asp Asn Trp Phe 

40 

GAG ATG ACC GAG 
Glu Met Thr Glu 
420 



TGG AGG AAG ATT GCC 
Trp Arg Lys He Ala 
310 

ACT CTG GAC GAC CTA 
Thr Leu Asp Asp Leu 
325 

ACT TTC CGC CAG AGC 
Thr Phe Arg Gin Ser 
345 

CTC AAG CGC CAG GAT 
Leu Lys Arg Gin Asp 
360 

GAA GCC CGG AAA GTG 
Glu Ala Arg Lys Val 
375 

AAG AAG TAT GTG TGG 
Lys Lys Tyr Val Trp 
390 

AAG ACC TAG GAC CCC 
Lys Thr Tyr Asp Pro 
405 

GCT GTG GAA GGC CAC 
Ala Val Glu Gly His 
425 



ACC ATC CAG CAG ACC 
Thr He Gin Gin Thr 
315 

GAG GAA CGA GTG AAG 
Glu Glu Arg Val Lys 
330 

TTC TTC TCA GAT CCT 
Phe Phe Ser Asp Pro 
350 

GCC CGA ATC ATC GTG 
Ala Arg He He Val 
365 

TTC TGT GAG GTA TAC 
Phe Cys Glu Val Tyr 
380 

TTC CTC ATT GGG TGG 
Phe Leu He Gly Trp 
395 

TCC ATC AAC TGC ACA 
Ser He Asn Cys Thr 
410 

ATC ACC ACT GAG ATT 
He Thr Thr Glu He 
430 



ACC GAG 960 
Thr Glu 
320 

GAG GCT 1008 

Glu Ala 

335 

GCC GTG 1056 
Ala Val 

GGA CTT 1104 
Gly Leu 

AAG GAG 1152 
Lys Glu 

TAT GCT 1200 
Tyr Ala 
400 

GTG GAT 1248 

Val Asp 

415 

GTC ATG 1296 
*Val Met 
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CTG AAC CCA GCC AAC ACC CGC AGC ATC TCC AAC ATG ACA TCC CAG GAG 1344 
Leu Asn Pro Ala Asn Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu 
435 440 445 

5 

TTT GTG GAG AAA CTG ACC AAG AGA CTC AAG AGA CAC CCT GAG GAG ACA 1392 
Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr 
450 455 460 

10 GGC GGC TTC CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG 1440 
Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu 
465 470 475 480 

GCA TTG GCC CTG AAC AAG ACA TCT GGA GGG AGC GGC CGT TCG GGG GTG 1488 
15 Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Ser Gly Arg Ser Gly Val 

485 490 495 

CGC CTG GAA GAC TTC AAC TAC AAC AAC CAG ACG ATC ACA GAC CAA ATC 153 6 

Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie 
20 500 505 510 

TAC CGC GCA ATG AAC TCC TCG TCC TTT GAG GGT GTC TCT GGC CAC GTG 1584 
Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val 
515 520 525 

GTG TTT GAT GCC AGC GGC TCA CGG ATG GCC TGG ACT CTG ATT GAG CAG 1632 
Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin 
530 535 540 

CTG CAG GGT GGC AGC TAC AAG AAG ATC GGC TAC TAT GAC AGC ACC AAG 1680 
Leu Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys 
545 550 555 560 

GAT GAC CTT TCC TGG TCT AAA ACG GAC AAA TGG ATT GGA GGG GCC CCC 1728 
35 Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp He Gly Gly Ala Pro 

565 570 575 

CCG GCC GAC CAG ACC CTG GTC ATC AAG ACA TTT CGC TTC ATG TCA CAG 1776 
Pro Ala Asp Gin Thr Leu Val He Lys Thr Phe Arg Phe Met Ser Gin 
580 585 590 



25 



30 



40 



AAG CTC TTC ATT TCA GTC TCT GTC CTC TCC AGC CTG GGC ATT* GTC CTG 1824 
Lys Leu Phe He Ser Val Ser Val Leu Ser Ser Leu Gly He Val Leu 
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595 600 605 — 

GCT GTG GTC TGT CTG TCC TTT AAC ATC TAC AAC TCT CAT GTC CGT TAC 1872 

Ala Val Val Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr 
5 610 615 620 

ATC CAG AAC TCC CAG CCC AAC TTG AAC AAT CTG ACT GCT GTG GGC TGC 1920 

lie Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys 

625 630 635 640 

10 

TCC CTG GCA TTG GCT GCC GTC TTC CCC CTG GGG CTA GAT GGG TAC CAC 1968 

Ser Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His 
645 650 655 

15 ATC GGG AGA AGC CAG TTT CCT TTT GTG TGT CAG GCA CGC CTC TGG CTC 2016 

lie Gly Arg Ser Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu 
660 665 670 

CTG GGT CTG GGC TTC AGT CTG GGC TAT GGC TCC ATG TTC ACG AAG ATC 2064 

20 Leu Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie 
675 680 685 

TGG TGG GTC CAC ACG GTC TTC ACT AAG AAG GAG GAG AAG AAG GAG TGG 2112 

Trp Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp 
25 690 695 700 

AGG AAG ACC CTG GAG CCC TGG AAG CTG TAC ACC AGA GTG GGC TTG CTA 2160 

Arg Lys Thr Leu Glu Pro Trp Lys Leu Tyr Thr Thr Val Gly Leu Leu 

705 710 715 720 

30 

GTG GGC ATG GAT GTC CTC ACT CTT GCC ATT TGG CAG ATG GTA GAC CCC 22 08 

Val Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin Met Val Asp Pro 
725 730 735 

35 TTG CAC CGG ACC ATT GAG ACT TTT GCC AAG GAG GAA CCA AAG GAA GAT 2256 
Leu His Arg Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp 
740 745 750 

ATT GAT GTG TCC ATC CTG CCC CAG CTG GAG CAC TGC AGC TCC AAG AAA 2304 
40 lie Asp Val Ser lie Leu Pro Gin Leu Glu His Cys Ser Ser Lys Lys 
755 760 765 

ATG AAC ACC TGG CTT GGC ATT TTC TAT GGT TAC AAG GGG CTG CTG CTG 23 52 
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Met Asn Thr Trp Leu Gly lie Phe Tyr Gly Tyr Lys Gly Leu Leu Leu ~~ 
770 775 780 

CTG CTA GGC ATC TTT CTT GCT TAT GAG ACC AAG AGC GTG TCT ACT GAG 2400 
Leu Leu Gly lie Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu 
785 790 795 800 

AAG ATC AAT GAC CAC CGG GCT GTG GGC ATG GCC ATG TAC AAC GTG GCG 2448 
Lys He Asn Asp His Arg Ala Val Gly Met Ala Met Tyr Asn Val Ala 
805 810 815 

GTC CTG TGC CTC ATC ACT GCC CCG GTC ACC ATG ATC CTG TCC AGC CAG 2496 
Val Leu Cys Leu He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin 
820 825 830 

CAG GAT GCA GCT TTC GCC TTT GCA GCT CTT GCC ATA GTG TTC TCC TCC 2544 
Gin Asp Ala Ala Phe Ala Phe Ala Ala Leu Ala He Val Phe Ser Ser 
835 840 845 

TAC ATC ACT CTG GTC GTT CTG TTC GTG CCG AAG ATG CGC AGG TTG ATC 2592 
Tyr He Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu He 
850 855 860 

ACC CGG GGT GAG TGG CAG TCG GAG GCG CAG GAT ACC ATG AAA ACG GGG 2640 
Thr Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly 
865 870 875 880 

TCG TCG ACC AAC AAC AAT GAG GAA GAG AAG TCC CGA CTG TTG GAG AAG 2688 
Ser Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys 
885 890 895 

GAG AAC CGG GAG CTG GAG AAG ATC ATT GCT GAG AAA GAG GAG CGA GTG 273 6 

Glu Asn Arg Glu Leu Glu Lys He He Ala Glu Lys Glu Glu Arg Val 
900 905 910 

TCC GAG CTG CGC CAT CAG CTT CGT TCT CGG CAG CAG CTG CGC CCT CGG 2784 
Ser Glu Leu Arg His Gin Leu Arg Ser Arg Gin Gin Leu Arg Pro Arg 
915 920 925 

CGT CAC CCC CCG ACG CCC CCA GAC CCC TCA GGG GGC CTG CCC AGG GGA 2832 
Arg His Pro Pro Thr Pro Pro Asp Pro Ser Gly Gly Leu Pro Arg Gly 
930 935 940 
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CCC CAT GAG CCC CCT GAC CGG CTC AGC TGT GAC GGG AGC CGG GTT CAt 28^3"0 
Pro His Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His 
945 950 955 960 



5 TTG CTG TAC AAG TGA 
Leu Leu Tyr Lys * 
965 



10 (2) INFORMATION FOR SEQ ID NO: 53: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 amino acids 

(B) TYPE: amino acid 
15 (D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

20 

Met Leu Leu Leu Leu Leu Pro Leu Ala Leu Ala Pro Leu Phe Leu Arg 
15 10 15 

Pro Pro Gly Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly 
25 20 25 30 

Cys Gin lie lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly 
35 40 45 

30 Leu Thr Arg Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr 
50 55 60 



Glu lie Glu Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys 
65 70 75 80 

35 

Val Arg Lys Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro 
85 90 95 



Ser Arg Cys Val Arg lie Cys Ser Lys Ser Tyr Leu Ala Leu Glu Asn 
40 100 105 110 

Gly Lys Val Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala 
115 120 125 
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Arg Val Asp Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser 
130 135 140 

5 Arg Ser lie Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys 
145 150 155 160 

Gin Val Ser Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly 
165 170 175 

10 

Ala Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin 
180 185 190 

Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie 
15 195 200 205 

Leu Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp 
210 215 220 

20 Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro 
225 230 235 240 

lie Lys lie lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val 
245 250 255 

25 

Ala Glu Ala Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser 
260 265 270 

Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg 
30 275 280 285 

Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe 
290 295 300 

35 Glu Lys Trp Gly Trp Arg Lys lie Ala Thr lie Gin Gin Thr Thr Glu 
305 310 315 320 

Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala 
325 330 335 

40 

Gly lie Glu lie Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val 
340 345 350 



HI 865-1 WOSEQ 65 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 



PCT/SE98/01947 



Pro Val Lys Asn Leu Lys Arg Gin Asp Ala Arg lie lie Vai Gly Leu 
355 360 365 

Phe Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu 
5 370 375 380 

Arg Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala 
385 390 395 400 

10 Asp Asn Trp Phe Lys Thr Tyr Asp Pro Ser lie Asn Cys Thr Val Asp 

405 410 415 

Glu Met Thr Glu Ala Val Glu Gly His lie Thr Thr Glu lie Val Met 
420 425 430 

15 

Leu Asn Pro Ala Asn Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu 
435 440 445 

Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr 
20 450 455 460 

Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu 
465 470 475 480 

25 Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Ser Gly Arg Ser Gly Val 

485 490 495 

Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie 
500 505 510 

30 

Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val 
515 520 525 

Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin 
35 530 535 540 

Leu Gin Gly Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys 
545 550 555 560 

40 Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ala Pro 

565 570 575 

Pro Ala Asp Gin Thr Leu Val lie Lys Thr Phe Arg Phe Met Ser Gin 
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580 

Lys Leu Phe lie Ser 
595 

5 

Ala Val Val Cys Leu 
610 

lie Gin Asn Ser Gin 
10 625 

Ser Leu Ala Leu Ala 
645 

15 lie Gly Arg Ser Gin 
660 

Leu Gly Leu Gly Phe 
675 

20 

Trp Trp Val His Thr 
690 

Arg Lys Thr Leu Glu 
25 705 

Val Gly Met Asp Val 
725 

30 Leu His Arg Thr lie 
740 

lie Asp Val Ser lie 
755 

35 

Met Asn Thr Trp Leu 
770 

Leu Leu Gly lie Phe 
40 785 

Lys lie Asn Asp His 
805 



585 

Val Ser Val Leu Ser 
600 

Ser Phe Asn lie Tyr 
615 

Pro Asn Leu Asn Asn 
630 

Ala Val Phe Pro Leu 
650 

Phe Pro Phe Val Cys 
665 

Ser Leu Gly Tyr Gly 
680 

Val Phe Thr Lys Lys 
695 

Pro Trp Lys Leu Tyr 
710 

Leu Thr Leu Ala lie 
730 

Glu Thr Phe Ala Lys 
745 

Leu Pro Gin Leu Glu 
760 

Gly lie Phe Tyr Gly 
775 

Leu Ala Tyr Glu Thr 
790 

Arg Ala Val Gly Met 
810 



590 

Ser Leu Gly lie Val Leu 
605 

Asn Ser His Val Arg Tyr 
620 

Leu Thr Ala Val Gly Cys 
635 640 

Gly Leu Asp Gly Tyr His 
655 

Gin Ala Arg Leu Trp Leu 
670 

Ser Met Phe Thr Lys lie 
685 

Glu Glu Lys Lys Glu Trp 
700 

Thr Thr Val Gly Leu Leu 
715 720 

Trp Gin Met Val Asp Pro 
735 

Glu Glu Pro Lys Glu Asp 
750 

His Cys Ser Ser Lys Lys 
765 

Tyr Lys Gly Leu Leu Leu 
780 

Lys Ser Val Ser Thr Glu 
795 800 

Ala Met Tyr Asn Val Ala 
815 
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Val Leu Cys Leu 
820 

Gin Asp Ala Ala 
835 

Tyr lie Thr Leu 
850 

Thr Arg Gly Glu 
865 

Ser Ser Thr Asn 



Glu Asn Arg Glu 
900 

Ser Glu Leu Arg 
915 

Arg His Pro Pro 
930 

Pro His Glu Pro 
945 

Leu Leu Tyr Lys 



lie Thr Ala Pro 



Phe Ala Phe Ala 
840 

Val Val Leu Phe 
855 

Trp Gin Ser Glu 
870 

Asn Asn Glu Glu 
885 

Leu Glu Lys lie 



His Gin Leu Arg 
920 

Thr Pro Pro Asp 
935 

Pro Asp Arg Leu 
950 



Val Thr Met lie 
825 

Ala Leu Ala lie 



Val Pro Lys Met 
860 

Ala Gin Asp Thr 
875 

Glu Lys Ser Arg 
890 

lie Ala Glu Lys 
905 

Ser Arg Gin Gin 



Pro Ser Gly Gly 
940 

Ser Cys Asp Gly 
955 



Leu Ser Ser Gin 
830 

Val Phe Ser Ser 
845 

Arg Arg Leu lie 



Met Lys Thr Gly 
880 

Leu Leu Glu Lys 
895 

Glu Glu Arg Val 
910 

Leu Arg Pro Arg 
925 

Leu Pro Arg Gly 



Ser Arg Val His 
960 



(2) INFORMATION FOR SEQ ID NO : 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
(iii) HYPOTHETICAL: NO 
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(iv) ANTI-SENSE: NO ~~ 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 



(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION : 1 . .1737 

10 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 

15 10 15 

15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 

20 25 30 

20 ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
25 Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
30 6 5 7 0 7 5 8 0 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

35 

GTC CGA ATC TGC TCC AAG TCT TAT TTG ACC CTG GAA AAT GGG AAG GTT 336 

Val Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 110 

40 TTC CTG ACG GGT GGG GAC CTC CCA GCT CTG GAC GGA GCC CGG GTG GAT 3 84. 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 



HI 865-1 WO SEQ 69 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 

TTC CGG TGT GAC CCC GAC TTC CAT CTG 

Phe Arg Cys Asp Pro Asp Phe His Leu 
130 135 

5 TGT AGT CAG GGC CAG TGG AGC ACC CCC 
Cys Ser Gin Gly Gin Trp Ser Thr Pro 
145 150 

CGA ACG CCA CAC TCA GAA CGG CGC GCA 
10 Arg Thr Pro His Ser Glu Arg Arg Ala 

165 



PCT/SE98/01947 

GTG GGC AGC TCC CGG AGC ATC 43T2 

Val Gly Ser Ser Arg Ser lie 
140 

AAG CCC CAC TGC CAG GTG AAT 480 

Lys Pro His Cys Gin Val Asn 
155 160 

GTG TAC ATC GGG GCA CTG TTT 528 
Val Tyr lie Gly Ala Leu Phe 
170 175 



CCC ATG AGC GGG GGC TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG 576 
Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val 
15 180 185 190 



GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC 624 

Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp 

195 200 205 

20 

TAT GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA 672 

Tyr Glu Leu Lys Leu He His His Asp Ser Lys Cys Asp Pro Gly Gin 
210 215 220 



25 GCC ACC AAG TAC CTA TAT GAG CTG 
Ala Thr Lys Tyr Leu Tyr Glu Leu 
225 230 

ATC CTT ATG CCT GGC TGC AGC TCT 
30 He Leu Met Pro Gly Cys Ser Ser 

245 



CTC TAC AAC GAC CCT ATC AAG ATC 720 
Leu Tyr Asn Asp Pro He Lys He 
235 240 

GTC TCC ACG CTG GTG GCT GAG GCT 7 68 

Val Ser Thr Leu Val Ala Glu Ala 
250 255 



GCT AGG ATG TGG AAC CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA 816 

Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro 
35 260 265 270 

GCC CTG TCA AAC CGG CAG CGT TTC CCC ACT TTC TTC CGA ACG CAC CCA 864 

Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 

275 280 285 

40 

TCA GCC ACA CTC CAC AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG 912 

Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp 
290 295 300 
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GGC TGG AAG AAG ATT GCT ACC ATC CAG 
Gly Trp Lys Lys He Ala Thr He Gin 
305 310 

TCG ACT CTG GAC GAC CTG GAG GAA CGA 
Ser Thr Leu Asp Asp Leu Glu Glu Arg 
325 



PCT/SE98/01947 

CAG ACC ACT GAG GTC TTC ACT 96 0 

Gin Thr Thr Glu Val Phe Thr 
315 320 

GTG AAG GAG GCT GGA ATT GAG 100 8 

Val Lys Glu Ala Gly He Glu 
330 335 



10 ATT ACT TTC CGC CAG AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA 105 6 

He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys 
340 345 350 

AAC CTG AAG CGC CAG GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG 1104 
15 Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu Phe Tyr Glu 
355 360 365 

ACT GAA GCC CGG AAA GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT 1152 
Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe 
20 370 375 380 

GGG AAG AAG TAC GTC TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG 1200 

Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp 

385 390 395 400 

25 

TTC AAG ATC TAC GAC CCT TCT ATC AAC TGC AC A GTG GAT GAG ATG ACT 1248 

Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr 
405 410 415 

30 GAG GCG GTG GAG GGC CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT 1296 
Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro 
420 425 430 

GCC AAT ACC CGC AGC ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG 13 44 

35 Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu 
435 440 445 

AAA CTA ACC AAG CGA CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC 13 92 

Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe 
40 450 455 460 - 

CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC 1440 
Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala 
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465 470 475 460 

CTG AAC AAG ACA TCT GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG 1488 
Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu 
485 490 495 

GAC TTC AAC TAC AAC AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA 153 6 

Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala 
500 505 510 

ATG AAC TCT TCG TCC TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT 1584 
Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp 
515 520 525 

GCC AGC GGC TCT CGG ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT 1632 
Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly 
530 535 540 

GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT 1680 
Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu 
545 550 555 560 

TCC TGG TCC AAA ACA GAT AAA TGG ATT GTT ATA TCC AGA ACT CAC AGC 1728 
Ser Trp Ser Lys Thr Asp Lys Trp He Val He Ser Arg Thr His Ser 
565 570 575 



CCA ACC TGA 
Pro Thr * 



1737 



(2) INFORMATION FOR SEQ ID NO : 55: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 57 8 amino acids 

(B) TYPE: amino acid 
( D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 55: 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
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1 5 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 ■ 25 30 

5 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
10 50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

15 Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 

85 90 95 

Val Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 110 

20 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 

Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser Arg Ser lie 
25 130 135 140 

Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys Gin Val Asn 
145 150 155 160 

30 Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe 

165 170 175 

Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val 
180 185 190 

35 

Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp 
195 200 205 

Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro Gly Gin 
40 210 215 220 

Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie 
225 230 235 240 
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lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala 
245 250 255 

5 Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro 
260 265 270 

Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro 
275 280 285 

10 

Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp 
290 295 300 

Gly Trp Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr 
15 305 310 315 320 

Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu 
325 330 335 

20 lie Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys 
340 345 350 

Asn Leu Lys Arg Gin Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu 
355 360 365 

25 

Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe 
370 375 380 

Gly Lys Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp 
30 385 390 395 400 

Phe Lys lie Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr 
405 410 415 

35 Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro 
420 425 430 

Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu 
435 440 445 

40 

Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe 
450 455 460 
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Gin Glu Ala Pro 
465 

Leu Asn Lys Thr 



Asp Phe Asn Tyr 
500 

Met Asn Ser Ser 
515 

Ala Ser Gly Ser 
530 

Gly Ser Tyr Lys 
545 

Ser Trp Ser Lys 



Pro Thr 



Leu Ala Tyr Asp 
470 

Ser Gly Gly Gly 
485 

Asn Asn Gin Thr 



Ser Phe Glu Gly 
520 

Arg Met Ala Trp 
535 

Lys lie Gly Tyr 
550 

Thr Asp Lys Trp 
565 



Ala lie Trp Ala 
475 

Gly Arg Ser Gly 
490 

lie Thr Asp Gin 
505 

Val Ser Gly His 



Thr Leu lie Glu 
540 

Tyr Asp Ser Thr 
555 

lie Val He Ser 
570 



Leu Ala Leu Ala 
480 

Val Arg Leu Glu 
495 

He Tyr Arg Ala 
510 

Val Val Phe Asp 
525 

Gin Leu Gin Gly 



Lys Asp Asp Leu 
560 

Arg Thr His Ser 
575 



(2) INFORMATION FOR SEQ ID NO : 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 
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(A) NAME /KEY : CDS "~ 

(B) LOCATION : 1 . .1386 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56: 

ATG GGG CCC GGG GCC CCT TTT GCC CGG GTG GGG TGG CCA CTG CCG CTT 48 
Met Gly Pro Gly Ala Pro Phe Ala Arg Val Gly Trp Pro Leu Pro Leu 
15 10 15 

CTG GTT GTG ATG GCG GCA GGG GTG GCT CCG GTG TGG GCC TCC CAC TCC 9 6 

Leu Val Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 30 

CCC CAT CTC CCG CGG CCT CAC TCG CGG GTC CCC CCG CAC CCC TCC TCA 144 
Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

GAA CGG CGC GCA GTG TAC ATC GGG GCA CTG TTT CCC ATG AGC GGG GGC 192 
Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG ATG GCG CTG GAG 24 
Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 

GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC TAT GAG CTC AAG CTC 28: 
Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
85 90 95 

ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA GCC ACC AAG TAC CTA 33 
lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC ATC CTT ATG CCT GGC 3 8< 

Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
115 120 125 

TGC AGC TCT GTC TCC ACG CTG GTG GCT GAG GCT GCT AGG ATG TGG AAC 43 
Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 



CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA GCC CTG TCA* AAC CGG 
Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 



48 
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145 150 

CAG CGT TTC CCC ACT TTC TTC CGA ACG 
Gin Arg Phe Pro Thr Phe Phe Arg Thr 
5 165 



PCT/SE98/01947 

155 160 — 

CAC CCA TCA GCC AC A CTC CAC 528 
His Pro Ser Ala Thr Leu His 
170 175 



10 



AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG GGC TGG AAG AAG ATT 576 
Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 

GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT TCG ACT CTG GAC GAC 624 
Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

15 CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG ATT ACT TTC CGC CAG 672 
Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
210 215 220 

AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA AAC CTG AAG CGC CAG 720 
20 Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG ACT GAA GCC CGG AAA 768 
Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
25 245 250 255 

GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT GGG AAG AAG TAC GTC 816 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
260 265 270 

30 

TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG TTC AAG ATC TAC GAC 864 

Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys lie Tyr Asp 
275 280 285 

35 CCT TCT ATC AAC TGC ACA GTG GAT GAG ATG ACT GAG GCG GTG GAG GGC 912 
Pro Ser lie Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT GCC AAT ACC CGC AGC 960 
40 His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 



ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG AAA CTA ACC AAG CGA 



1008 
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lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys' Arg — 
325 330 335 

CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC CAG GAG GCA CCG CTG 1056 
Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
340 345 350 

GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC CTG AAC AAG ACA TCT 1104 
Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 

GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG GAC TTC AAC TAC AAC 1152 
Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA ATG AAC TCT TCG TCC 1200 
Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 

TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT GCC AGC GGC TCT CGG 1248 
Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT GGC AGC TAC AAG AAG 1296 
Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
420 425 430 

ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT TCC TGG TCC AAA ACA 1344 
lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 

GAT AAA TGG ATT GTT ATA TCC AGA ACT CAC AGC CCA ACC TGA 13 86 

Asp Lys Trp lie Val lie Ser Arg Thr His Ser Pro Thr * 
450 455 460 



(2) INFORMATION FOR SEQ ID NO : 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 461 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

Met Gly Pro Gly Ala Pro Phe Ala Arg Val Gly Trp Pro Leu Pro Leu 
5 1 5 10 15 

Leu Val Val Met Ala Ala Gly Val Ala Pro Val Trp Ala Ser His Ser 
20 25 * 30 

10 Pro His Leu Pro Arg Pro His Ser Arg Val Pro Pro His Pro Ser Ser 
35 40 45 

Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
50 55 60 

15 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
65 70 75 80 

Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
20 85 90 95 

lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
100 105 110 

25 Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
115 120 125 

Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
130 135 140 

30 

Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
145 150 155 160 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
35 1 65 170 1 75 

Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
180 185 190 

40 Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
195 200 205 

Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu He Thr Phe Arg Gin 
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Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
225 230 235 240 

5 

Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
245 250 255 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
10 260 265 270 

Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys lie Tyr Asp 
275 280 285 

15 Pro Ser lie Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
290 295 300 

His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
305 310 315 320 

20 

He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
325 330 335 

Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
25 340 345 350 

Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 
355 360 365 

30 Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
370 375 380 

Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn Ser Ser Ser 
385 390 395 400 

35 

Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
405 410 415 

Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
40 420 425 430 

He Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
435 440 445 
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15 



40 
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Asp Lys Trp lie Val lie Ser Arg Thr His Ser Pro Thr 
450 455 460 



(2) INFORMATION FOR SEQ ID NO: 58; 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1746 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 



(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

20 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Canis familiaris 

(ix) FEATURE: 

(A) NAME /KEY : CDS 
25 (B) LOCATION: 1. .1746 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 

ATG CTG CTG CTG CTC CTG CCG CTG GCG CTG GCG CCG CTC TTC CTC CGC 4 8 

30 Met Leu Leu Leu Leu Leu Pro Leu Ala Leu Ala Pro Leu Phe Leu Arg 
15 10 15 

CCC CCG GGC GCG GGC GGG GCA CAG ACC CCC AAC GCC ACC TCG GAA GGT 96 
Pro Pro Gly Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly 
35 20 25 30 

TGC CAG ATC ATA CAC CCG CCT TGG GAA GGG GGT ATC AGG TAC AGG GGC 144 
Cys Gin lie He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly 
35 40 45 



CTG ACT CGT GAC CAG GTG AAG GCT ATC AAC TTC CTG CCG GTG GAC TAT 192 
Leu Thr Arg Asp Gin Val Lys Ala He Asn Phe Leu Pro Val "Asp Tyr 
50 55 60 
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GAG ATT GAG TAT GTG TGC CGG GGA GAG CGA GAG GTG GTG GGG CCC AAG 240 

Glu lie Glu Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys 
65 70 75 80 

5 

GTC CGA AAG TGC CTG GCC AAT GGC TCC TGG ACA GAT ATG GAC ACA CCC 288 

Val Arg Lys Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro 
85 90 95 

10 AGC CGC TGT GTC CGA ATC TGT TCC AAG TCA TAT TTG GCC CTG GAA AAT 33 6 

Ser Arg Cys Val Arg lie Cys Ser Lys Ser Tyr Leu Ala Leu Glu Asn 
100 105 110 

GGG AAG GTC TTC CTG ACG GGT GGG GAC CTC CCC GCT CTG GAT GGA GCC 384 

15 Gly Lys Val Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala 
115 120 125 

CGG GTG GAT TTC CGG TGT GAC CCT GAC TTC CAT CTT GTG GGC AGC TCC 432 

Arg Val Asp Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser 

20 130 135 140 

CGG AGT ATC TGT AGT CAG GGC CAG TGG AGC ACT CCC AAG CCC CAC TGC 480 

Arg Ser lie Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys 
145 150 155 160 

25 

CAG GTG AGC CGA ACG CCG CAC TCA GAG CGG CGA GCG GTG TAC ATC GGG 528 

Gin Val Ser Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly 
165 170 175 

30 GCG CTG TTT CCC ATG AGC GGG GGC TGG CCG GGG GGC CAG GCC TGC CAG 576 

Ala Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin 
180 185 190 

CCC GCG GTG GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC 624 

35 Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp He 
195 200 205 

CTG CCG GAC TAC GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG TGT GAC 672 
Leu Pro Asp Tyr Glu Leu Lys Leu He His His Asp Ser Lys Cys Asp 

40 210 215 220 

CCA GGC CAA GCT ACC AAG TAC CTG TAT GAA CTG CTC TAC AAC GAC CCC 720 

Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro 



HI 865-1 WOSEQ 82 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 



PCT/SE98/01947 



225 

ATC AAG ATC ATC 
lie Lys lie lie 

5 

GCT GAG GCT GCC 
Ala Glu Ala Ala 
260 

10 

AGC TCA CCA GCT 
Ser Ser Pro Ala 
275 

15 ACT CAT CCC TCG 
Thr His Pro Ser 
290 

GAG AAG TGG GGC 
20 Glu Lys Trp Gly 
305 

GTG TTC ACA TCG 
Val Phe Thr Ser 

25 

GGG ATT GAG ATT 
Gly He Glu He 
340 

30 

CCT GTC AAG AAC 
Pro Val Lys Asn 
355 

35 TTC TAT GAG ACT 
Phe Tyr Glu Thr 
370 

CGG CTC TTT GGG 
40 Arg Leu Phe Gly 
385 

GAC AAT TGG TTC 



230 

CTC ATG CCT GGC TGC 
Leu Met Pro Gly Cys 
245 

AGG ATG TGG AAC CTC 
Arg Met Trp Asn Leu 
265 

CTG TCC AAC CGG CAG 
Leu Ser Asn Arg Gin 
280 

GCC ACG CTC CAC AAC 
Ala Thr Leu His Asn 
295 

TGG AGG AAG ATT GCC 
Trp Arg Lys He Ala 
310 

ACT CTG GAC GAC CTA 
Thr Leu Asp Asp Leu 
325 

ACT TTC CGC CAG AGC 
Thr Phe Arg Gin Ser 
345 

CTC AAG CGC CAG GAT 
Leu Lys Arg Gin Asp 
360 

GAA GCC CGG AAA GTG 
Glu Ala Arg Lys Val 
375 

AAG AAG TAT GTG TGG 
Lys Lys Tyr Val Trp 
390 

AAG ACC TAC GAC CCC 



235 

AGC TCT GTC TCC ACG 
Ser Ser Val Ser Thr 
250 

ATT GTG CTC TCC TAT 
lie Val Leu Ser Tyr 
270 

CGC TTT CCT ACC TTC 
Arg Phe Pro Thr Phe 
285 

CCT ACG CGA GTG AAG 
Pro Thr Arg Val Lys 
300 

ACC ATC CAG CAG ACC 
Thr He Gin Gin Thr 
315 

GAG GAA CGA GTG AAG 
Glu Glu Arg Val Lys 
330 

TTC TTC TCA GAT CCT 
Phe Phe Ser Asp Pro 
350 

GCC CGA ATC ATC GTG 
Ala Arg He lie Val 
365 

TTC TGT GAG GTA TAC 
Phe Cys Glu Val Tyr 
380 

TTC CTC ATT GGG TGG 
Phe Leu He Gly Trp 
395 

TCC ATC AAC TGC ACA 



240 — 

CTT GTG 768 

Leu Val 

255 

GGT TCC 816 
Gly Ser 

TTC CGA 864 
Phe Arg 

CTC TTT 912 
Leu Phe 

ACC GAG 960 
Thr Glu 
320 

GAG GCT 1008 

Glu Ala 

335 

GCC GTG 1056 
Ala Val 

GGA CTT 1104 
Gly Leu 

AAG GAG 1152 
Lys Glu 

TAT GCT 1200 
Tyr Ala 
400 

GTG GAT 1248 
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Asp Asn Trp Phe Lys Thr Tyr Asp Pro Ser lie Asn Cys Thr Val Asp — 
405 410 415 

GAG ATG ACC GAG GCT GTG GAA GGC CAC ATC ACC ACT GAG ATT GTC ATG 1296 
Glu Met Thr Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met 
420 425 430 

CTG AAC CCA GCC AAC ACC CGC AGC ATC TCC AAC ATG ACA TCC CAG GAG 13 44 

Leu Asn Pro Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu 
435 440 445 

TTT GTG GAG AAA CTG ACC AAG AGA CTC AAG AGA CAC CCT GAG GAG ACA 13 92 

Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr 
450 455 460 

GGC GGC TTC CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG 1440 
Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu 
465 470 475 480 

GCA TTG GCC CTG AAC AAG ACA TCT GGA GGG AGC GGC CGT TCG GGG GTG 1488 
Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Ser Gly Arg Ser Gly Val 
485 490 495 

CGC CTG GAA GAC TTC AAC TAC AAC AAC CAG ACG ATC ACA GAC CAA ATC 153 6 

Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He 
500 505 510 

TAC CGC GCA ATG AAC TCC TCG TCC TTT GAG GGT GTC TCT GGC CAC GTG 1584 
Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val 
515 520 525 

GTG TTT GAT GCC AGC GGC TCA CGG ATG GCC TGG ACT CTG ATT GAG CAG 1632 
Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin 
530 535 540 

CTG CAG GGT GGC AGC TAC AAG AAG ATC GGC TAC TAT GAC AGC ACC AAG 1680 
Leu Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys 
545 550 555 560 

GAT GAC CTT TCC TGG TCT AAA ACG GAC AAA TGG ATT GTT ACA TCC AGA 1728 
Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp He Val Thr Ser Arg 
565 570 *575 
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ACT CCC AGC CCA ACT TGA 
Thr Pro Ser Pro Thr * 
580 



PCT/SE98/01947 



1745 



5 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 581 amino acids 
10 (B) TYPE: amino acid 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

Met Leu Leu Leu Leu Leu Pro Leu Ala Leu Ala Pro Leu Phe Leu Arg 
15 10 15 

20 

Pro Pro Gly Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly 
20 25 30 

Cys Gin lie lie His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly 
25 3 5 40 45 

Leu Thr Arg Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr 
50 55 60 

30 Glu He Glu Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys 
65 70 75 80 

Val Arg Lys Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro 
85 90 95 

35 

Ser Arg Cys Val Arg He Cys Ser Lys Ser Tyr Leu Ala Leu Glu Asn 
100 105 110 

Gly Lys Val Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala 
40 115 120 125 

Arg Val Asp Phe Arg Cys Asp Pro Asp Phe His Leu Val Gly Ser Ser 
130 135 140 
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Arg Ser lie Cys Ser Gin Gly Gin Trp Ser Thr Pro Lys Pro His Cys 
145 150 155 160 

5 Gin Val Ser Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly 

165 170 175 

Ala Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin 
180 185 190 

10 

Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie 
195 200 205 

Leu Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp 
15 210 215 220 

Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro 
225 230 235 240 

20 lie Lys He He Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val 

245 250 255 

Ala Glu Ala Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser 
260 265 270 

25 

Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg 
275 280 285 

Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe 
30 2 9 0 2 9 5 3 0 0 

Glu Lys Trp Gly Trp Arg Lys He Ala Thr He Gin Gin Thr Thr Glu 
305 310 315 320 

35 Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala 

325 330 335 

Gly He Glu He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val 
340 345 350 

40 

Pro Val Lys Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu 
355 360 365 
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Phe Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu 
370 375 3 80 

Arg Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala 
5 385 390 395 400 

Asp Asn Trp Phe Lys Thr Tyr Asp Pro Ser lie Asn Cys Thr Val Asp 
405 410 415 

10 Glu Met Thr Glu Ala Val Glu Gly His lie Thr Thr Glu lie Val Met 
420 425 430 

Leu Asn Pro Ala Asn Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu 
435 440 445 

15 

Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr 
450 455 460 

Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu 
20 465 470 475 480 

Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Ser Gly Arg Ser Gly Val 
485 490 495 

25 Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie 
500 505 510 

Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val 
515 520 525 

30 

Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin 
530 535 540 

Leu Gin Gly Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys 
35 545 550 555 560 

Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Val Thr Ser Arg 
565 570 575 



40 Thr Pro Ser Pro Thr 
580 
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((2) INFORMATION FOR SEQ ID NO : 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(iv) ANTI -SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 3415 . .3440 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 3 441. .3903 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 39 04 . .3988 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 3 989 . .4689 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 4690 . .4893 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 4894. .5645 

(ix) FEATURE: 

( A ) NAME / KEY : exon 
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(B) LOCATION: 5646 . .5831 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 5832. .7181 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 7182 . .7202 

(ix) FEATURE: 

(A) NAME/ KEY : intron 

(B) LOCATION: 7203 . . 8307 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 8308 . . 8803 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 8804 . . 12266 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 12267 .. 12401 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 12402 . .12815 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 12 81 6. .12986 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 12 987 . . 14085 

(ix) FEATURE: 

(A) NAME/ KEY: exon 

(B) LOCATION: 14086. .14187 

(ix) FEATURE: 



HI 865-1 WO SEQ 



89 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 



PCT/SE98/01947 



(A) NAME /KEY : intron 

(B) LOCATION: 14188 . . 14473 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 14474. .14539 

(ix) FEATURE: 

(A) NAME / KEY : intron 

(B) LOCATION: 14540. .14998 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 14999 . . 15190 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 15191. .16862 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 60: 

GATCATATTA ATTTGAAGGT GGCGGGGCAG GATGGTTCTG TGGTGCAGTT TAAGATTAAG 60 

AGGCATACAC CACTTAGTAA ACTAATGAAA GCCTATTGTG AACGACAGGG ATTGTCAATG 120 

AGGCAGATCA GATTCCGATT CGACGGGCAA CCAATGAAAC AGACACACCT GCACAGTTGG 180 

AAATGGAGGA TGAAGATACA ATTGATGTGT TCCAACAGCA GACGGGAGGT GTCTACTGAA 240 

AAGGGAACCT GCTTCTTTAC TCCAGAACTC TGTTCTTTAA AGACCAAGAT TACATTCTCA 300 

ATTAGAAAAC TGCAATTTGC TTCCACCACA TCCTGACTAC TACCGTATAG TTTTCTCTAT 3 60 

TCTTTCATTT CCCCCTTCCC CATTCCTTTA CTGTACATAA AGTAACTGGT ATATGTGCAC 420 

AAGCATATTA CTTTTTTTTT TTAAAACTAA ACAGCCAATG GTATGTTTTG ATTGACATCA 480 

AGTGGAGACG GGGGGGAAAA TACTGATTCT GTGAAAATAC CCCCTTTCTC CATTAGTGGC 540 

ATGCTCATTC AGCTCTTATC TTTATATTCC AGTAAGTTAT TTTGCTCTCA CTGTTTTAAC 600 

AACAACAACA AAAAAACAAC AACATAAAAA TCCTTGCATA CCTTGTTCAA TTGGAGAATT 660 
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TTAATGTTTT TCATTTATCA TTGTAAAACC AAGGACAATT TTATAACTTT TTTGTACTTA 72 0 

GCTGTTACAT GCAGAGCAAT CTGTCTTTAA GTAGGGATAA ATTACTCTAA AACAAAAAAG 780 

AATCCTAGAT AGTTTTCCCT TCAAGTCAAG CGTCTTGTTG TTTAAATAAA CTTCTTGTTT 840 

AAAAAAAAAA AAAGTAAAAA AGAAAAGTTA TGCAACAATT AATGGCCCAG AGGCAATCCT 900 
TGTTAACATT TTGATGCATC TTTTAGCTGT TTTTTTTTTT TTTTTTTTTT TTGACTGAGT 960 

TTGACTCTTG TCACCCAGGC TGAAGTGCAA TGGCATGGCA TGATCTTGGC TCACTGCAAC 1020 

CTCCGCCTCC CGGGTTCAAG TGATTCTCCT GCCTCAGCCT CCTGAGTAGC TAGGATTACG 1080 

GGCATGCACC ACCATGCCTG GCTAATTTTG TATTTTTAGT AGAGTTGGGG CTTCTCCACA 1140 

CTGGTCAGGC TGGTCTCGAA CTCCCAACCT CAGGTGATAA GGGAAGGGGC ACTATTGACA 1200 

TTTATGGTTG GGGCAGAGGT GTAAGATATT CTTCAAAGCA CTACCTACAT GTTGAAGAAT 1260 

TGTTCCTCAC CCAGATTCTC AAAAGTCCCC CAGGACATTC ACGTAGTGAA AACCTGTGTT 1320 

TAATTATCTG AGCCTATAAC TTAATACAGT TTTAAAATTT TTTTTTAAAT ATACAGTGAA 1380 

CTTTCTAGGA ATGCAATTAT AGTTGTGTGT AAAATTAGGG AAAATTAACT TTGCTACCAA 1440 

GAGTTGTTCA ACATTTTGTT AAATCACTTC ATTGATGGCA AC ATGC TGG A GGTAGTTGAG 1500 

TCACCAACTC AGCACCTGGA TCAGCCTGTG TTGGTAGCAG TTTCATCCCC GTGGTTCTGT 1560 

GAATAGGTGG AAGCATCTGC TTACTCCATC AGGACTTCTA GGGTAGTCGG GCCTTGGCAC 1620 

TCACACATTA AAATACTGTT TATGTTATTT TATTGCAAGT TACTTTTCTT TCATTTCCCC 1680 

TTTACGTTAC AGAAAGGGAA GCATTTTGCT TTCTGTTTAA AGTTGTGTAT GTAGGTAGGT 1740 

TATATCATCT AWGACTTTCT CTCCCTCCTT CCCTTTCTTT TTGTTTGAGA TGGAGTCTTG 1800 

CTCTGTCACC CAGGCTGGAG TGCAGTGGTG CGATCTTGGC TCACTGCAAC CTCTGCCTCC 1860^ 

CGGGTTCAAG CGATTCTGGT GTCTCAGCTG GGATTACAGG CGCACACCAT CACACCACGC 1920 
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TAATTTTTCT ATTTTTAGTA GAGATGGGGT TTCGCCATGC TGGCCAGGCC AGGCTGGTCT 198TJ 

CAAACTCCTG AGCTCAAGTG ATCAGTCCGC CTCGGCCTCC CAAAGTTCTG GGATTTCAGG 2040 

CGTGAGCCTC ATCTATGAAT CTCAATTTAG GACAGTAAAA GTGTCATTAC AAAAATATTT 2100 

ATTGTAAAAA AGGGTTGGAG GTTGAGAATC TCAATTCTAG TCAGTCTCTC AGTGTTTGGT 2160 

TTCTTCCTAC CATTTTTCCC CCTAGGACCA GCCAGAAAGC AGCTTTTTTT TTGTCCCCCC 2220 

CAACAAGGAG CCCACTGTTT CCTCTCCCAG CCCAAACTCA GGCCTACGAA CAACAACAGC 2280 

ACTACACACA CACACACACA CACACACACA CACACACACA CACCCCTCCA CTTCAAGGTA 2340 

TAGCCAAGAG CTTCTGGAGC CGTCAAAAAG GTCTGTACCT GCTGTCTTTA GAGCTTCCAG 2400 

TTTGCCCTTG GTCAAGAAAT ACTGTTTGCT AGGCTCTGCT GGAGTACATC AGGTAATACT 2460 

GGCTTCTAAA CCACCCTGAG GTTCTTTTCT CTTGTCCTTT TACTCCCTTC GTACTTCAAT 2520 

TTCTCTCCTT GATGTCCCCC TCCCTGTTTT GTTTTTTGCC TCCAATCCGT TCTGCGCGTT 2580 

CCCTGCAGAG CAGGCGAGTA GCAATGCTGC TGGACCATGG AGCTGCTCTA GTCTCCCAGA 2640 

AATCTCTTCT ACACCCAACC CTTCTTGCGC TTAGGTGGTC CTCAGTCCCC CTCCCCCACC 2700 

TCCTTCTGAC CCAGGCTTCT TTCTCGCCCT CCGGTCGCAG TTCTCCTGGG CATCTGCCTC 2760 

TGCCTCTCTC CTCTCACCCG GATCTAGGGC TGCCTTCTCT TTGTGCAGCC GTCTTTCTCC 2 820 

ACCTTCATCC CAGACTCCCT GTCTCAGCGC CAGCTCCTCT GCCTTTGGCT CGGGTTCCCT 2880 

CTCCCCCACC CCAGCTTCCA GTTGTTTGGC CCGCAGGTCC CTCGGCAGTG ACCGGCGCCC 2940 

CCCGACGAGT GCGTGTGCAC CAGGGCACCT CCCTCTCCCC CACCTCTCAG CCCCGCGCCT 3000 

CTCCACCGCC CGCCCCACCG CGCTGTGGGC GGTCCAGGGC GGGGCTGGGA TCCGGGGCGG 3060 

CTCCCGGGGC TCGGGTTGTG GGAGGCGCCC TCTCCCCGGT CTTCCCCTCT CTTCCCCCCG 3120 

CCCTGCCTTC CCTTGCACCC TCCTTCTTCC CTCCGCCCGG GAGCTCTCCC TGGTCCCCGG 3180 

CGCCGCCTCC TTCCCTCCCG GCTCCCCGCT CCCCGCTCCC GTGGCTGCCG CCGCCCCGGG 3240 
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GAAGAAGAGA CAGGGGTGGG GTTTGGGGGA AGCGAGAGAG GAGGGGAGAG ACCCTGGCCA 33 00 

GGCTGGAGCC TGGATTCGAG GGGAGGAGGG ACGGGAGGAG GAGAAAGGTG GAGGAGAAGG 3360 

GAGGGGGGAG CGGGGAGGAG CGGCCGGGCC TGGGGCCTTG AGGCCCGGGG AGAGCCGGGG 3420 

AGCCGGGCCC GCGCGCCGAG GTAAGAGCCA GGGCCCCGGG TTAGCAGGGC TCGGAGAGGG 3480 

GGCGCGCGGC GTGGTGGGGG AGGGGGCAGT GGGCGCAGGG CCCAGCTGGG GGAAGCGGGG 3540 

CTGGGGGAGA GGAGGAACCG CGGGGATGGA ATCGGGGAGC GCTGAGGCGG CCGATGCCGG 3600 

GAGCGTGGGT AAGCCAGGCT TCTGCGAGCC GCGGGGGCCG GGGGAGAGGA GGTGGTGAGA 3660 

GGTGGAGTCC GGGAGGGTTG GGGGCCGAGG GAGGCAGGAG GAGGGTGGGG ACAGGCTTTC 3720 

TCTCCTCCTC TCCCCCCACC CCGCGCGGGG CTCCGCCCCC GCCTCCTCCG CGGGGCGCTC 3780 

TCTTGGTCCC CAGGCTGAGC CCGGTCGGAG CCTGCGAGGC AACCGGCAAG AGGTCGAGTA 3 840 

GTCTCCGGGT GCGGGCCGCG CCGGCGGGGC TCGGTCCAGT CCTCATGGCC GCCTCTCACT 3 900 

TAGATGTTGC TGCTGCTGCT ACTGGCGCCA CTCTTCCTCC GCCCCCCGGG CGCGGGCGGG 3960 

GCGCAGACCC CCAACGCCAC CTCAGAAGGT GCATCCTTCT TCGACGACCT CCGGCCCTCC 402 0 

TTCGCTCCAC TTCCCTTTCC CTGCATCTCC TCATTTCTGG TCCTCATCAC TATCCCATCA 4080 

GTCCCACATA TCATCCCGGT CTGGCAACCC CTTCTGCTCG GCCCGACTTT ACTACTGCTG 4140 

ACCTCCTTCT GTCACCCCAC GTTACTATCC AGCACCTCTT TTCTCTGCCC ACATTGCTAC 4200 

ACTATACCAC CTTCCTGTGC ATTTTCTCCG CCTCAATCCC CTTTCCCAGC CCCACATTAC 4260 

TACCTCAATT ACTCCCTTTT CTTGGTCCCA CTTTGCTGTC CAGATGATCT TATTAGCCTC 4320 

CCTTTATCCT CCTATCCTAA TTCAACTGGA ATATCCTCAT TTAGCCTTTT TTTTTAAAGA 4380 

AAAGCTCCAC CCACATATCA TACCCTTCAT GATTTCTTAA TTACTTTTCT TTCTTACCTC 4440 

CACCCAGCAC CCTTCCCTCC CCACTTGTGG GTTCTCTCAT CAGCTTTAAC CCTGGCCCTT 4500 
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TACTCTCTGT CCTTTAGCCA GGGGATCTGT 
CATCCCTCTT CCTCTGTCCC CAGCCTGCCC 
5 CACTGGGGAG CCTGCCTTTT CCTCTTTCCC 
ACCCCTTAGG TTGCCAGATC ATACACCCGC 
TGACTCGGGA CCAGGTGAAG GCTATCAACT 

to 

TGTGCCGGGG GGAGCGCGAG GTGGTGGGGC 
CCTGGACAGA TATGGACACA CCCAGCCGCT 
15 CTTCAAGACT ATTCCTTTTC CTGCCGCAAA 
TAAATCCAGT ATACCAAAAT TCACAAATAC 
TTTTGCTCTG TGCGGTTGGA GGTAGTAGAG 

20 

CCTTCATTAG GCCACTGATC ATTGGCCTAT 
TCTTTGAGGA TCATTAGTGG CAGATGATGA 
25 TTTTGAATAC CTCTGTCACC AACCCAGAGA 
ATATTAATAG AAGCCAACTA TAATAAGAAA 
TACATCTACA TATTTCAAAG CCAGAGAACC 

30 

ATTTTGTGTG CTTATAGCCC CATCTTGGGT 
GGAATGTGTG GATGCTTGCA CAGGTAAGGG 
35 TCACATGTGA TTTTC TTT AG AGCAGGATGC 
TGCTCTTTAA AGGGAAAGAG CCTTCTTCAA 
CACAGTCCGA ATCTGCTCCA AGTCTTATTT 

40 

GGGTGGGGAC CTCCCAGCTC TGGACGGAGC 
CCATCTGGTG GGCAGCTCCC GGAGCATCTG 



PCT/SE98/01947 

ACCTGTCCCC ACTCCCACCC TCTAGTGCCC 4560 

ACAGACCACG CCCTACTCTC CCCTTCCTCC 4620 

ACCATTCCTC TCTGTATGCC TCCCCGACTC 4680 

CCTGGGAAGG GGGCATCAGG TACCGGGGCC 4740 

TCCTGCCAGT GGACTATGAG ATTGAGTATG 4800 

CCAAGGTCCG CAAGTGCCTG GCCAACGGCT 4860 

GTGGTGAGTA GCCTCGGAAG CCCCTCCCCT 4920 

CTTAGCATTA CTGCTTGCAA GTCAGCACTT 4980 

ATTTATTGAA TGACTACTAC ATAAGAGCAA 5040 

CTAGCAGCCT GCACAGTTCA TTTCATCCTC 5100 

AACATTGATA ATTCATCTTG TCAGTTATTC 5160 

CAAAAAAATT C TAAAATG AT TTCATCACAT 5220 

CCATATGCCC AAGAAACAAA AGCCAGTTTA 5280 

AGCAAATCTG ATTGTGCATC CAAAGTTATA 5340 

GCCCACTGTA GCTGACTTTG AAGAGATCCC 5400 

TCCTAAAATG GTAATTTTTT TTTTCTTTTG 5460 

AGGATTGGAA GATAGGTAGG CAAATCCTTT 5520 
TTGTGGACCC AAACCTGCAC CTGAGTCCCC 5580 
CTCGCCTCTC TTCTTATTTT CCTATCTCTC 5640 
GACCCTGGAA AATGGGAAGG TTTTCCTGAC 5700 
CCGGGTGGAT TTCCGGTGTG ACCCCGACTT 5760 
TAGTCAGGGC CAGTGGAGCA CCCCCAAGCC 5820 
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CCACTGCCAG GGTGAGGGGA ACAGCTGCCT GCATGCAGCT GATGAGGACG CTTGTGTGAG 5880 

GATGGGAGTG GGGTGG GAAT GGATAATGGG AAAGAATGGA GAGCTATAAA AATGTGGGGG 5940 

5 

AGGACACTGG AAAGGGGAGA TGAAAGTCCC TTTTTCCTCC ATCACCTGCC TCAAACTTCC 6000 

TCTTGCAGTC CCCGGTATCC TCTGTAGGTT GGGGGCTTCC TTCCTTTACC TTTTAAAAAA 6060 

10 ATCTTCCTGC TCCCGATTCT TAGACCTCAC GTTTTCTCTT TTCCTTTATG AATCTCACCT 6120 

CTCTCACCTT CTTCAGGTTT AAATACTCCA ATTTTCCCTT TCTCTAAACT TAGAAATTTC 6180 



15 



CATGCATCAC CCTCTTCTAG AATTCATCCC TCACCATTCC TTATATAATT GATTTATTGT 6240 
AAAGACTCAG AAATAAATCA AACATTCTAC TAAGAAAAAT TGAGAAGGGG AGCTCTGGGG 6300 
GTGGAAACAT ATTAGGGTAA AAGACTTAAA ATTGGAGGCA GCATTATCAG AAGATGAAGA 6360 



20 ACAACTCAGG GATGGGGTGG GAAGAAGACA GGTCCTTTTC TGKACTTCCT AGACAACCTC 6420 
CATTATTCCC TAAGGGAATC AGTGTTGTGT CTGTCTACYT TTTTTTTTTT TTTTTTGCCA 6480 



25 



CGTAATTTTA CAAACTCTCC CTTTTCTAGG CACCCGAACT CTCTGCCATC TTCTCTCCTG 6540 
GGATGCAGTC ATCCCATTTG TATGCCTCAT ACTTCCTCTA CCCTGGTAGA TTCTTTCAAG 6600 
ATCCTTGGGC TTTACTTTCC TCACATAACT CAGTTATTCT GCTTCTAGTT TACCATTTTA 6660 



30 



TTCTGGAAAT TGAGAGTCCC ATCCAGGGGT GGACTTATGA CACTACTGAA ACTTAGACTT 6720 



35 



CAAGGTTCCT CACCTACAGG GCCCTCTTCC TGTGCTCTAA TAATATAGAG GGCTCGATGG 6780 

ATATGTGTTC ATATGGTAAC AGGCTTTTGT AAAAATTGCA GAAATAAGAT TTTAACAGCA 6840 

ATTGCTTAAA GCCAATTGTA TGTGTAATTT TTTTTCTTAA AGACTCCCAA TTTTGTAATA 6900 

TTCAGGCACC ACAGAACCAA GATCTGCCCC AAACTTAGCT ATTGGCATTC CCGTCTCAAA 6960 



40 



TTCTGTTGTC CTATGAAAAA TCGAAGAAGA AAATAAGTCC TGACCCCCTT ACCCCCAGAC 7020 » 



CCACCTTGTT CTTATCCCCA GGCACCCTCC CCTCAGAAAC GCAGGCTTCT GCTCTCCCCG 7080 
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GTCTTCAGCA rj^Qp^CAGG'TG TGGGAGGGGG CTGGGGATCA GGCCAGGGAA GCTGGGCGCC 71^0 

AGTGGTAACT CTTCTCTGAT CCCCGTCTTT CCTGCTGCCA GTGAATCGAA CGCCACACTC 7200 

AGGTGAGATG AGAAACCCTT ACCGCGCGCA CTGCAATGCC CTCCCCTTCA CTCTGCACCC 7260 

TCCACCCCCC TGAAATTCTG CCCTTAGGCT ACGGGGCGTC GTCCTTTCGC ACCTTCCCCA 7320 

ACCCACCCCA GTTTGCGGCC ACCCCCTTCC CTCCCTACCT GTTTCCTGCC TCCAGTCCCG 7380 

GTTTTCCACG AGGCTGCGGT CTCTCCTTGT CCCTGCTTGG CTACACTTCC CTGGGCTCCA 7440 

CCTCCTCCCA GACTGAGCCT CGCCGGTGTC AGGCAGAGCC CAGCAGARGG CGGCAGGGTG 7500 

CTGGGAGACC CTGAGCTCCC ACCACGTTTT CCCCTGTGGG GTTCCTTGCG ACCTTCGCTG 7560 

GAACCTTTTC CAGCCTGCTG CCTCCTAGGA TTTCACCTAA TGGACTTTCT CAGCCTGTCC 7620 

CACCCATCCC AACCCTGGCC AGGCCTCTCG CGCTCTTCCC CACATCTTTT CCTTCCGTGT 7680 

ACCCCTTCCC TCGTCTTTTC TCAATTCCAT GTCCTGTCTC CCTTTCTTAG GCTTCTGTCT 7740 

ACCCAGCCCC AGGCTCCCTT CCACGACCCC ACCACTCCCT CAAACCAGCC TCCCTTCCGT 7800 

ACCCAACTCG TTCCCTCCAA AACCGTTTCC TCTCCCCCAC ATCCTCAGTG CTTCACTGTA 7860 

TCGACTCATA CTCCCACTTC AGACCTCAGG CGCCAGCCCC GTTTCTCTCC CGTCCCACTC 7920 

GCATCCTTCC CTTCCTACCC TGGTTCCTCC GTGCTTCAGC CTCCCGCGGC TCCCTCCGCC 7980 

CACCCCGCCC TCCTGGCACG CCCCGTCCCC ATTTCTCCTC CCCTCGGGTC CCCTTAAGTG 8040 

AGATCCCTCC CTTCCTCTTT CGTTCCTTTC CTCCTCGAGG TTGCATCCCC CCTCCCCTCC 8100 

CCGCCCCTCC GACTGTCGCT CCCACCTCGG CGCTCGCTTC CCTCCCCGCC CCCTTCCTGC 8160 

CTCCCCAGCT CCCGCCCGCC CCCCCACCCC CCGCTGCCGC GCGCCGCCCG TGACGTCAGA 8220 

GCCCCCTCCC AGCCCCACAT CTCCCTCCTG CTCCTCCTCC TCCCCTCCGT CGGTCAGTCA 8280 

GTCCGCGAGG AGAGTCCGCG GTGGCGGCGA CGGTGGCGAG AGCCGCGGGG GCCGTAGGAA 8340 

GCCAACCTTC CCTGCTTCTC CGGGGCCCTC GCCCCCTCCT CCCCACAAAA TCAGGGATGG 8400 
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AGGCGCCTCC CCGGCACCCT CTTAGCAGCC CTCCCCGGGA AAAGTGTCCC CCCTGAGCTC 8460 

CTAACGCTCC CCAACAGCTA CCCCTGCCCC CCACGCCATG GGGCCCGGGG CCCCTTTTGC 8520 

CCGGGTGGGG TGGCCACTGC CGCTTCTGGT TGTGATGGCG GCAGGGGTGG CTCCGGTGTG 8580 

GGCCTCCCAC TCCCCCCATC TCCCGCGGCC TCACTCGCGG GTCCCCCCGC ACCCCTCCTC 8640 

AGAACGGCGC GCAGTGTACA TCGGGGCACT GTTTCCCATG AGCGGGGGCT GGCCAGGGGG 8700 

CCAGGCCTGC CAGCCCGCGG TGGAGATGGC GCTGGAGGAC GTGAATAGCC GCAGGGACAT 8760 

CCTGCCGGAC TATGAGCTCA AGCTCATCCA CCACGACAGC AAGGTAGCCC TGGACATGGG 8820 

GGTGGGTGGG AGGTGGGGGC TTGCGGGSCA GGGGGCCAAG CAAGCCTGCA CGCGCCCCCA 8880 

TCTGTCTGAG TCGTCTCTGG GATTGCGAGG CAGACCCCTC CCTTGTGTGA CTGGCAGGAG 8940 

ATGGGCTGGG GGTGCAGGAG CTTGGGAAGA GTCGCAGGGG CTGGAGGTCC AAGATGAGGG 9000 

TCTAGGGGCT CAAGATGGTT AAGCATGCTG CAAGGCAGAC CCTTCTGCCC CGCTGCGGGA 9060 

GTCTCGCAGA AGTGTCGGGG TTTGGAGAAA CTGGTGGTGG ATTTAAGGTA TTAGGAGACA 9120 

CTGATCCTCT GAGGGAGTAA ACTAACCCTG GAATGGGTTG GGGGTGGAGG GAATGTCAGA 9180 

GGTGGGGAGC TGGATTGGGG GGTTACATTT ACCATGGTAA CAAGGTAAAA TCTTGGCGTA 9240 

GGTTGGAGCT GGAAGGAATA GGGACAGAAT GAGGAAAATT TTGAGAGACT TGAGAGCTCT 9300 

AGTTTATTTA TCTTAACAAA ACAGCAAGGT AGTGGTGAGC CCTACCTGAC TCCTTCTCAT 93 60 

CCTTCTATTC CCAACCCTGT TGAGCATTCC CAGACTGTGG GATAGATGGC ATATGGTGAT 9420 

TGGGGAAGGC TAATGATCAA GAGGTGGGCA GAGGCACTGG GAAAATGAAT TGGATTGGGG 9480 

ATCCACATGG GAACCCCCAC AATAGCATGG GGATGAAGAA GAGTCAACAT ACAAGGAGAA 9540 

GAGAACAGAA AAGAATGGCA GTGGGGGAGA GGGGCAAGGA GGTAGCGTGG GGATAATGAG 9600 1; 

AGATCTTGGG GCACCTTATG GAACTTGGGT CCTGACCTTC CCTTCCCTTA TAGCATTGTG 9660 
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GCCTCTAGGA TGTGAGAAGG GAAATGGGAT GTAGGGATTA GGG AGGTGAG TTGAGGGAGA 9720 

GAGAGAAGGT AAGCAAATTT GGGTCCAGGG GTATTAGGGG ATAGCTTATA ATGAGGTTTT 9780 

TTTTCCCACC CCTCTCCCCT ACATGAATAA TTGGGGGTGC AGGGAAGGAT GTGACACAGG 9840 

GAAGGAGATT TAAGATCTCA AATTTATC TT CACTGACATG TGGCCCCAGA GACTTAAGGA 9900 

ATTGGGTTAG GGTGAAATAG AGTACACAAG GTGAGAATTT GGTGATCTTA CCAAATATCA 9960 

ACCTTGGGGT GATCCAAGGA TTTATATTCA TTCTTAGAAC ATCACTATAC ACCTAGAAAT 10020 

AGGTGTGTGT CTGGGATAGG TGTGTGAGGG GACAGAAGTG AGGTTGAAGG TAGGGTGCTT 10080 

GAAGAGAAGA GAGCACAAGG ATTATCAGGA GCTTGGCAAG AGAACTTAAA ATCCTTTTTG 10140 

ACTGTTACTT TCTCGTGGTT CTCAGCCTTC AGTGTACATA AGAATCACCA GAGGAGTTTG 10200 

TTAAAAATAC AGATTCTAGC TCCTTGGTCA GGGATGAATC CCAAGTATTT ATCTGTATTT 10260 

TTACTAATAG ACATCCTATC TTGGTGGATT CCTGAGCTGT AAGCTAACCC CAGAATGCCT 10320 

ATGGGAAGAG CAGCAGGGTA CAGGAAAATA ATTAGGTATT AGGGTACGGG AGGCAGGAAG 10380 

AGAAGTAGAG GATCAGATCT GGTAGAGGGT CAGACTTGGG ACAGTCAGAG AGATCATTGG 10440 

TTTTGGGGAG TGGAGTGTGA AGAAAATGAC AGGGAGAGAT GGGTGCAGGC TTTATGATAG 10500 

GGGATCACAG GAGATAGGGG AGGCCTGGCT GTGAGCTCAA ACTCATCCAC CATGACAGGT 10560 

GATTCCCTGG AGGTGGCGGG GAGCAGACGT GGGACCTGGG AGAAGGGAAC TGGAGAACAT 10620 

CAGAGGCATC AAGCGGGGTG GGATGGGAAG GCAGAAGAAC CAGAATGTGT CAATTGGAAT 10680 

GAGTCGGTTT CCTGCCTGCA AATCCAGATC CTTGCAAGAG CAAAGAGAGG GAGGAGAACT 10740 

AAGGAAATCT ATTGGGGAGG GGGAGAGAAT CACGTGGTGG AGAGAATCTG CAGTGATGAA 10800 

TAGTGTGTGG AAG AGGG AAA CGGTTGCAAG AAAAGGTAGA TAAGAAATCA GGAAACAAAA 10860 

TGGGGGGCAT GCCTTGCCCT GTTGATATGT ATCTTATATG TTCTTGAATG TCCTCATTGK 10920 

TCCTATTAAC CCTGTCTTTA GAGAAGTGGA GGGGCACTGA GGGGCTGTGG GAGAAGCTGG 10980 
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GAGCAGGATC TGGAGTAATA GATGTGGGGA GAGTGCAGGA AGGTGGGTCC TGAGAATGGT 11040 

AAAGATTTAC AAAGTTGCCC TAGTGGGAGG CATAAAGAGA AAACCTTCCA ATGTTGTTGA 11100 

5 

GCACTGCCCT TGGCCAGAGT GAGGGTAGGG TGGGCAACAG AGAATTCTCA GTGACTGCTG 11160 

GTTCTTCAGA TTCCAACAGC TTCCCCTGGC TCCCCCTTCT CCAACTTCCC ACCGTGTCCC 11220 

10 AAATGTCAGG CCTCAGTGGG AGGTAAGCAG GCTCCAGAGT GCTTTCTTTA TTTCCTTTCT 11280 

ACTTATCCTC CCCTCCTGGC AACATTTCAC CCTCCTTAGT CCCCTGAGCC CCCTGTCTGT 11340 

GTCCCCTCTG CCCTGGCTCC CCACTGGCTG CCATTTCGTC TTCACATGCA TTGGGGTTCC 11400 

15 

AGCAGCTTCT GAAATGTCAT ATATCAGTGG GAGGGGAACA GGCAGTGGGA GACCCAAGGC 11460 

TGGCTCTTCC TCCCCCATTT CCCCTCCTCC CAAGCTTCCT TTCTTCTCCA GCTTTCTGCT 11520 

20 TGTTTACTTT CCCTAGCTCC AAGCCTCTCT TTAAGGCACC TCTCAAATTG KCTGGTTTCT 11580 

TGAGAGTTCC ATTCTATTCA TTCTCTCTGT TCTTTCCTCA TCCTACATTC TTCCCTACTT 11640 

CCACCCCCCA GTGTCTTTTT TTCTAATGGA CCTGTCAAAT GTCAGCGCCC AGCAGGAGGG 11700 

25 

ATGGATCACT GAGCGGGACC CCCTACTGGT CTTGTTCCTG TTCTCTCTTT ACTTATCACT 11760 

AGCTCTGAAA AGAGAAGAGG GAGGAAACAA ATGGAAGGTG GGGAGAAGGG GTTTGCAGAG 11820 

30 GTGAGGAAGG AATTTTCATA ATATGGCTTT GAGCAAGCTA TCTGGGGATG TGGAAAGAGT 11880 

TTACCGTATT CCTACTGACT TCTTCCACCC ACTGGTGTTT GAAGCATAGA AACATGGGGT 11940 
AAAGGGCTTG GTGACAGAGG GAAGGGGGAT GTCTGAGGGT GAGCTGAAAG GAGGTAAGGT 12000 

35 

GGTATGTTCA TTAATACCAA AGGAGGGGTG TGCAGGAGAG GTGATGGGTA AGGCTCCAGA 12060 
TGGAAGACAG AGAAGGAAGT TTAATGAAAG AGGAGAAAAA AGGCACTTGA CAGGAAGAGA 12120 
40 TGCCAGAAAG GAGAAGAAAA CGGTAATTAA TGATGAAAGT GAGTAATTGA GAAAGGAACT 12180:, 
AATTTGTTCG AGAAAGATAA GAGCAGGAAT TGCAGACAGG GGAGGGGCCC CAGGAGAGCT 12240 
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TGCCCTCATC TCCTCTTGTC TTTCAGTGTG ATCCAGGCCA AGCCACCAAG TACCTATATG 12300 

AGCTGCTCTA CAACGACCCT ATCAAGATCA TCCTTATGCC TGGCTGCAGC TCTGTCTCCA 12360 

5 CGCTGGTGGC TGAGGCTGCT AGGATGTGGA ACCTCATTGT GGTAAGCAGG GCTATGGGGG 12420 

TCAGAAGATG GGGTCATTCC CTTTTGAGCT CTACTGAAGG GACGATGGCG ATTGTGGGTT 12480 

TGTATTGAAA AGGAGTGTGG AGGACCTGCT ACTAAGATTC AGAGTCCTCT GCAGACCTGA 12540 

GCTAGGCAGC CTCCTAGCAA CAGTGGCCTG ACAGTGCTGC AGCTGACCTC CTTCTTCAGA 12600 

AGGAATTGAA ATTAGATCAG TGAAAGAGCA TCCCGGTTGT GAGGGGTGTG TGGGCCTTTG 12660 

5 AGAATCTCTT TTCCTTAGGC AGACCAGAGG TGGGGAGGTT TGGAGAGAGT AAGGAAGAGA 12720 

AACCCAAAGG CAGGAAGAGG GTTAAAGGAA CTCTTGGCCA CTCTTGGTGT CCTCAGTGAA 1278 0 

CAGACCCTGT TGCACTCACT CTCCCTGCCC CACAGCTTTC CTATGGCTCC AGCTCACCAG 12840 

CCCTGTCAAA CCGGCAGCGT TTCCCCACTT TCTTCCGAAC GCACCCATCA GCCACACTCC 12900 

ACAACCCTAC CCGCGTGAAA CTCTTTGAAA AGTGGGGCTG GAAGAAGATT GCTACCATCC 12960 

25 AGCAGACCAC TGAGGTCTTC ACTTCGGTGA GGAGGGGTTG GGCAAGGGGT AAAGGGACAT . 13 020 

AAGCTCAAAT TCCAGCACCA GGAGATGTGA CGTGAGAGTC ACTTTTAGGG GCAAGAACTT 13 080 

GATTCTTCAT TGAAAGAGAA CGCATTCCAT GTGGATTAAG TGCAGTTCTT TCTGTAGCCA 1314 0 

30 

GGGGAAAGAA TGAGTTGAGT TTTTGGGATC CTCTCTGTCT TTATGATTTT ATGATTTTTT 13200 

TCCCCTGTTT GATGCCCTGT TCCCCAGACA TATAGACCCA GAATGACTCA GTTCTGTTAA 13260 

35 AGTAGGTTCA ATCCAAAGTG GGGGCAAGAG ATGGGAGCGA AGATGAGATA GGAATCCAGG 13320 

AAGGCAGCAG ATTCCAGAGG CTTTCAAGGG GGGTGGTGGG TGGGTGTGAA TGGGAACAGA 133 80 

GGGGATGGAG CCAGTGGATT ACAGAGGAGA GAGGGAGAGG AAAGAGAGAG AGAGAGAGGA 13440 

40 

ATGAGGGAGA GGAGAGAGAG GGGCAGAAAG GCAGCTGCAT GGATCTGGTA GTTGGTACTA 13500 

AGAGAGAGAA GCCGACAGAC AAGGAGAGGT TGAGGGGGAA GAGGGAGATT TGGGGAGGTA 13560 
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GAGAGGAAAT ACAGGCTCTA CATCTGAAGA AGGCAGTCTG CTCCCTCCCT TTTATTCTAT 13 620 

TCTTTGGGTC TTCTATCCAC TGTGTTCAGT GGCCCTTTAA TCCTCCCCCA CTTTCACTCT 13680 

5 

GATTCAGACC ATTCTTCTCT GATCCTTTGT CTGTCTGCCC ATTTGCCTCT TGAGGTTGAC 13740 

ATCATGCTGT CTGTCCCAGT CCTTGCCTTG TCTTTTCCTG GTTCCTTTAT GTTTCTTTAC 13 800 

10 CCCATCTTTG CCTTCAGTGG TAGGAGTGGG TGAATGGAGT GGCTTCCCCC ACACAGAGCC 13 860 

TCAGCAGGGG CTCACCATTC ACCTTCCCAC TTGGAATCCA CATCC TAAGA CCAGATGCCT 13 92 0 

TCCCGAACTC CTCACTTCAG GGACAGAAGC TGTTGAAGGA AGGTTCAGAA TGGCTGCTTC 13980 

TTTGCTCTAT CTGAGTATTG CTCTGAAATC CCCAGTTAAC CTCTCTGGTC TTTATTCCCT 14040 

CATGCACCCC GTGTTTTTCC AACTTGTTTT TTATTCCCAC CCAAGACTCT GGACGACCTG 14100 

GAGGAACGAG TGAAGGAGGC TGGAATTGAG ATTACTTTCC GCCAGAGTTT CTTCTCAGAT 14160 

CCAGCTGTGC CCGTCAAAAA CCTGAAGGTC AGATGGCTGG GAGTGGTGGG CTCTGTTTAC 14220 

GGAGGGACCA AGCTGGGGGA CAGTGACTGG TTGGAGAGGA AAGCCAGGCG GGGGCAGGTT 14280 

TTGATTCTCT GAGGCAATAG CATCTCCTGG GGAAGTTTAG CTCCATCTTC CAGTTGACGT 14340 

TTATTCACTA TAC GTTGAGC GTTACCCTGC ACTAAGCACT TTGGGATGGG AAATCAAAGC 14400 

30 TGTGAAGACA TCTGGCTTAG CCCCTCAGGC ATTCCCGGGC ATCCCTCAGG AGCTGTTTCT 14460 

TTCTCTGTTG TAGCGCCAGG ATGCCCGAAT CATCGTGGGA CTTTTCTATG AGACTGAAGC 14520 

CCGGAAAGTT TTTTGTGAGG TGGAGTTGGA TCTGAAGAGG GAGGGGCACT GGGTGGGAGT 14580 

35 

TTCCCTTGGT TTTCTTGTGG GGCCTCCTCT TGGCATCTGT GCCTGAGTTG ATAGCATATG 14640 

ATCTGAGGTG ACGATTCATA GGATGTCTCT GTCTGTTGGC TCTGACTGCA TCCCTTGTCT 14700 

40 GCACACACAT GATACTTTCT TCAGATCTCA TTTTTCTACT GCTTTGTGTT TCCTGAGAAG 14760, 

CCCATGAATT CCATCTGTCC TGACTGGCTG GAAAAGGCCA CTCAGAAATA CAGGGGCTGG 14820 
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GGAGAAACTT AGAAGGAAGA ATTGTCAGCC TTTCCTACTA TCCCCAAGAC TTGTAGATTT 148^0 

CTCTTTTTAG TTCTACTGCT CTTCCCTGAT TCCCAAGAGG CTAAATAGTA TCAAGTGAGA 14940 

5 TAAGACAAAA ACAAACAAAT GAGCAAACAA AAACTCAGCC ATTCTCCTCT GTATTCAGGT 15000 

GTACAAGGAG CGTCTCTTTG GGAAGAAGTA CGTCTGGTTC CTCATTGGGT GGTATGCTGA 15060 

CAATTGGTTC AAGATCTACG ACCCTTCTAT CAACTGCACA GTGGATGAGA TGACTGAGGC 15120 

GGTGGAGGGC CACATCACAA CTGAGATTGT CATGCTGAAT CCTGCCAATA CCCGCAGCAT 15180 

TTCCAACATG GTGAGAGTGT GGGGACTTGC AGTCTGGCAC CTGGGAGGGT GGAGAGGACT 15240 

15 GAGGGGGCCT TGCAGGGGAA AGGGTGGCAG GGAGAGGGTG CGGAATTTGG ATATAAAGGA 15300 

GAAGAGGGGG CTGTGCCCAC CCTGAACTTG TCTGCATTAT GTTTCCTGTG GATCCTACCT 15360 

TTGCTCTGAC TTCCTTGGGT AGAGAGAGAA AAAAAAAAAA ACGATGGAGT TGTATGTTCA 15420 

20 

GTAGGTTCCT GATGAGTGGA AGGGCTGTTA CCATGGAGAC GAGGAGCAGT TGGTGAGAAG 15480 

TCAGGAGGAA CCGGCATTAA TGATAATATG GATGCTTGTA TACTCAAGCA CACCTTTACA 15540 

25 GGAGCACTGT GTCTGGGCAG AATTGCATTT CATTTTCTTG GTGATTTATG TTAGTGTTTT 15600 

AGAGTTGCTT AATATTCACT CATGATTGAT ATGCAATTAG CTTGGATCCT GTTGCTTTTT 15660 

TTAATGATTC TTTTTTTAGA ATTTTATGTG GAGAAGGGGC TTTTGAAATC ATTTAGCCCC 15720 

30 

AGACAGCTGG TTAATGACAG ACTTGGAGCG AGGAGGCGCA TTCTCTGATT TGAGAGGGCT 15780 

GGGGCAGCCC TGCAGCTGAG GGGCAGAAGT GAGTGTGGGT GTGGGCACAG GGGAAGAGGC 15840 

35 TTCTTTGTTG CTCAGCACTA CATTGTAAAG AGTACAGGAA TTTTGGGTAA GCTCGATACG 15900 

GCCACCTATG TGTCTGCTTA TTACACCAAG TGTGTACCTG CTTTGGAGAG CATGTTAGGG 15960 

GTGAGTTAAT GGTTGTAAAG CCCCAAAATA GCTTGATTTA AAAAGTTTTA CTAACAGTGG 16020 

40 

CATTTGTGTG GTACTTAGAT TCATGATTTC AGGGATTCTA ACAGCAAAAC TCTGATGTGG 16080 

ACTTATCTAC ATTTTGTAGA CCAGGAAAAT GAGACTCAGA GTGGTTTAAT GACTTGTCTC 16140 
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AGGTCATGCA GCTACTCAGC AGTCCAGCTG GAATTGAATC CTAGTTAAAT CTGGCTCCAA 16200 

ACTGCTCTGC TCATCACTTC CCATGGAAAA CCATGAAGCC TGACTAAAAA GTTTGTACTT 16260 

TTATCTGCTA ACACTGTTTA GCTAC TGGCA ATTGCTGAGC AAAGGAGAAA TAAGATTTAG 16320 

GAAGTTGACT CTGGTGGTGT GTTGTGATGA AAGATCTGCT GCCACATGGA CTCTGATGAT 16380 

ACACACTGTT TTGGGACATT CTAAATGGAC TCTATGTTTT AAGGTATGTT AGTTGAAAAA 16440 

TACAATGAAA AAATTTTGGA AGATCACTTT CCTGTAGAGT ACCCAAGAGG TGAGTCATCC 16500 

ATAAAACCTG GGAAAGTATT GGAAC TGGCT ACCAAAGCCA GAAAGACTTG TTGGTGCCCT 16560 

GAGCCCAGGA CTTGCCAATC GTTGCTTCTG TCTTATAGCA GGGAGCTAGA GGAGGCTGAC 16620 

AGAAAACAGG GATTTGGCTG GATCTAGTGG CTCACACCTG TAGTCCCCAG CACCTTGGGA 16680 

GGCCAAGGCA GAGGATCACT GGAGCCCAGG AGTTCAAGAC CAGCCTGGGC AATATAGTGA 16740 

GACCCCCAGC TCTAAACAAA AAGAGAGAGA GAGAGAACAG AGAGATGGTT AGGGACATAG 16800 

AGGGACACCT GAGCATTGGG GTGGGAAGAC TTGGAAGGTC CCTTTC TGAC TTTGTGTCTA 16860 



GA 



(2) INFORMATION FOR SEQ ID NO: 61: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16707 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 
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(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE : 

(A) NAME/KEY: intron 

(B) LOCATION : 1 . .2742 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION: 2743 . . 2985 

(ix) FEATURE: 

(A) NAME/ KEY: intron 

(B) LOCATION: 2986 . .3613 

(ix) FEATURE: 

(A) NAME /KEY: exon 

(B) LOCATION : 3614 . .3 677 

(ix) FEATURE: 

(A) NAME /KEY: intron 

(B) LOCATION: 3 678 . .5226 

(ix) FEATURE: 

(A) NAME / KEY : exon 

(B) LOCATION: 5227 . .5304 

(ix) FEATURE: 

( A ) NAME / KEY : in t r on 

(B) LOCATION: 53 05. .6 848 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 6849 . .6999 

(ix) FEATURE: 

(A) NAME / KEY : intron 

(B) LOCATION: 7000 . .7494 

(ix) FEATURE: 

( A ) NAME / KEY : exon 

(B) LOCATION: 7495 . .7 627 
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(B) LOCATION: 11370 . .11591 

<ix) FEATURE : 

(A) NAME /KEY : exon 

(B) LOCATION: 11592 . .11735 

(ix) FEATURE: 

(A) NAME /KEY : intron 

(B) LOCATION: 11736 . .12512 

(ix) FEATURE: 

(A) NAME /KEY : exon 

(B) LOCATION: 12513 . .13 999 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 61: 
AGCTAATTTT GGAATTTTTT TTTTTGAGAC AGAGTTTCAT TCTTATTGCC CAGGCTTGAG 60 

20 TGCAATGGCG CGATCTTGGC TCACAGCAAC CTCCGCCTCC CAGGTTCAAG CCATTCTCCT 120 

GCCTCAGCCT CCGGAGTAGC TGGGATTACA GGCATGCGCC ACCACACCCG GCTAATTTTG 180 

TATTTTT AG T AGAGACAGGG TTTCTCCATG TTGCTCAGGC TGGTCTCGAA CTCCGGACCT 240 

CAGGTGATCC ACCCGCCTCA GCCTCCCAAA GTGCTGGGAT TACAGGCGTG AGCCACCGTG 3 00 

CCCAGC TAAT TTTTGTATTT TTCATAGAGA CAGGGTTTCA CCATGTTGGC CAGGCTGGTC 3 60 

TTGAACTCCT GACCTCATGA TCTGCCCGCC TTGACCTCCC AAAGTCCTGA GATTACAGAT 420 

GTGAGCCTCC GTGCCCAGGC CAAGTCTGGC TAATTTTTAA AAAAATTTTG TAGAGTTGGG 480 

TCCTCTCTGT TTTGCCCAGT CTTGTCTCAA ACTCCTGGGC TCAAGGAATC CTCCTGCGTT 540 

GGCCTCTGAA AATGAAAATG TTGGGATTAC AGGCGTGAGC CCCCTGTGCC TGGCTGCCTT 600 

TTTTTTTTTT TTTTTTTAAA GGCAGAGTCT CACTCCATCG CCCAGGCTGA AGTGCAGTGG 660 

CGTGATCTCT GCTCACTGCA ACCTCTGCCT CTTGAGTTCA AGCGATTCTT CTACCTCAGC 720 , 

TTCCCAAGTA GCTGGGATTA CGGGAGCCCA CCAACACACC CAGCTAATTT TTGTATTTTT 780 
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30 



35 
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AGTAGAGACG GGTTTCACCA GGTTGGCCAG 
CACCCTCCTT GGCCTCCCAA AGAGCTGGGA 
5 ACTTGTTTTT TATAATGCCT TTTTTTTTTT 
AGGCTGGAGT GTAGTGGCGT CATCTCAGCT 
GATTTTCTCA CCTCAGCCCT CAGCCTCCTG 

10 

TGCCCAGCTA ATTTTTTGTA TTTTAGTAGA 
CTTTAACTCC TGAGCTCAGG CAGTCTGCTT 
15 GGTGTGAGCC ACTGTGCCTG GCCTTTTTTT 
TCCCAGCAGA ATTGAGATGA AGGTACAGAA 
TTCTACATTA TCGACATCCT CCGCCAGAGT 

20 

TTGATACATC ATAATCACCC AAAGTCCATA 
ATATTCTATG GGTTTGGACA AATGTATAAT 
25 AGTATTTTCA CTGGCCTAAT CCAATGGACA 
GTGCTAGATG CTGATGATTA AGAGAGGGCA 
CCTAGGCCCA TTGCATCTTC AAAGCCCAGG 

30 

CTGTACCTTG CCTGTGCCCT TTGCGTATCT 
GAAAAGTGAG CAAGGCATGA GGAACAAAGT 
35 TCTTTCATTA CCGTTTGTGT TAAAACATTC 
TGTGTATTAC TTGTGGTTTC CCAAGAAGTA 
TGAGGTAAGG AAGGAATGAT GTGATTTTCT 

40 

AAGTAATTTA CTTACAGTTA TTTAGCCAAA 
TCTCTTGACC CAGTTCATGA GAGAGTTAAA 



PCT/SE98/01947 

GCTGTCTGGA ATTCCTGACC TCAGGTGATC 8"$t) 

TTACAAGTGT GAGCCACTGT GCCCAGCCTG 900 

TTTTTGAGAC GGAGTCTTGC TCTGTCGCCC 960 

CACTGTAACC TCCACCTCCT GGGTTGAAGT 1020 

AGTAGTTGGG ACTGCAAGTG CACACCACCA 1080 

GATGGGGTTT CACCATGTTG CCCAGCTGGT 1140 

ACCTTGGCCT CCCAAAGTGC TAGGATTAAA 1200 

TTTTTTTTTT TTTTTTGAGC AGTTTTAGTT 1260 

ACTTCCCATA TGCTTCCCAC ATGCATAGCC 1320 

GGTACATTTG TTACAACTGA TGAACCTACA 1380 

GTTTACATTA GAGTTCACCC TTGGTGTTAT 1440 

GAGACGTATC TACTATTAAA TACTTTACAG 1500 

TTTATTGTTA CTTCATTATG GTTGGGCACA 1560 

TGGGATTTGG TCTTGTCCTC AAGGGTAGAA 1620 

CTCCTTCAAA GCCCAGTGTA GTAGCAACTG 1680 

CACTCCTCTA TCTCTCTAGA AAGTTGGAGA 1740 

TATTTATTTA TTCTTCATTC ATC TATTTAT 1800 

CAAACCCAAA CAATTATTTG TATGGTCCCC 1860 

GTTGCTAAGC TTTTCCTTGT ATGGTTTCTG 1920 

CCAGTATGTA GAATGCAGTT CCAAGAGGTT 1980 

CAAGGTTACT GCAAGGTATA TGAAGTCAGG 2040 

GGAACTATCA TTCTTTTTAG CTTTCATGGA 2100 
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AAAAGAAGGT TGAGTGTTGG GAGGGGTGTG GGTAGGATTG ATAATGGACT TCAAAAATGT 2160 

GAAGGGTATT TCTGTAGTTT TCATTCTTCT GAAAGCCTTC TAAGAGGCAG TGAACCAAAA 2220 

5 

GCACACAAGA ATGGCAAGAA GTTAGCATGC TGAAGAAATA TCCTCCTGGC TGGCAAGCAG 2280 

AGTGAGAAGA CTGCTATCAC CTTTTC TAGA ATCTTTTGGA ATTGTAGGAG CTGTTAGATC 2340 

10 CTGGGTTAAC TCTATGAAGA AAGTCAGAAG GATCAGAGAA CATCAGTGTC ACAGCTCTTC 2400 

ATTGGAATAT CCATGTCTCC TCCTTTACTC TGCTCTACCT TCCATCCTTT GCCACTAATT 2460 

ATCCAGAGTG TTTGTCAAAA TTCTCTGTTT GCAGTTCTGA GCTAGCAACT GTACACACTA 2520 

15 

ACACCATCAG ACACAGCTAA TACCTACTCT AGTCTAGTAG CTTCCGATCT AAGGCAGACA 2580 

CATGGGTATA GTTAAAGATT TTGAATGTAC ATGTGTC CAA TCTGACAACA GTAACACAAA 2640 

20 CCATCCATTC AAGTAGAAGT GATTGAGTCA GAATTGGATT GCACCCCTTC CCCCACACCC 2700 

ACACACATTT CAGTTCTTTC CTCATGATTT TTTCCTCCCA AGACATCCCA GGAATTTGTG 2760 

GAGAAACTAA CCAAGCGACT GAAAAGACAC CCTGAGGAGA CAGGAGGCTT C C AGGAGGC A 2820 

25 

CCGCTGGCCT ATGATGCCAT CTGGGCCTTG GCACTGGCCC TGAACAAGAC ATCTGGAGGA 2880 

GGCGGCCGTT CTGGTGTGCG CCTGGAGGAC TTCAACTACA ACAACCAGAC CATTACCGAC 2940 

30 CAAATCTACC GGGCAATGAA CTCTTCGTCC TTTGAGGGTG TCTCTGTGAG TTAAAACTTC 3000 

CTTCATACTC CCCTGTCTTC CCAATCTTGA GAGAGAC TCC CAAGAGGCAC CTTCTACAAA 3060 

CATGCATTCT CTGTTTTTCT CAGTTACTTC TTTGCAGAAT CAGTCTCCGA CCAGAGAAGT 3120 

35 

AGGGACCTTC AAATTAGAAG AACCCATCAA AGACTAGAGG AAAAAAAATG ATGTATTCCA 3180 

TTTTTTTAAA CCCCTCCCCT CATTTCTTTT CAAACTAGAC CAAGTATTCA TGAGTCAGAT 3240 

40 GAGAACTATA GGATTTTGAA AGACAAAACA GTCTGAAAGG TCATCTTCTT ATTCCTTTTA 3300 

AAATGAAAAG ATTAGTTTCC AGAGAGATTT GCTGACTTGC TTAGGCCACA CAACCAGAAG 33 60 
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CCTGCTGGTG TTCTGTCTGG GGATTTTTTC CCATTCAAAT CTCATAAGTG AAGCTCCTTC 3420 

TCCAAAGAAT AATGTTTCTA AAATCTAGGG TATGGGCATC TGGGGTATGT CCTATATGCA 3480 

5 GGCAAATGCC ATAAATAGCA TTCATTCAGA GGCTCAATTA CATCAAAAAC AGAAGGATTT 3540 

AAAGAGTCCC TGATGTTCTC TTTCACTCTT GCTTTTGTCT CCTTTGCCTT GCTCCACATG 3 600 

TTCCTTCCCT CAGGGCCATG TGGTGTTTGA TGCCAGCGGC TCTCGGATGG CATGGAC GCT 3 660 

TATCGAGCAG CTTCAGGGTT AGTACAGGGG CAGGAGGGGA CCGGACATGG GGGCTAGGCT 3720 

GGGGCTGGGC TGGGATGCCC CCTGGGGAAG AATGCCAGAG ACATCACAAG ATTGCCCTGG 37 80 

15 CACCTCCCAA CTTCTGCCCT TCTCTTTTAA CTCTGTTCAT CAAGCTTGTA AATAATAATA 3840 

ATAATAAGCT TAACTACAAG AAGATTGATG TCTTTGAGTT GCAC TGGTTT TGCTCTTGAA 3900 

AAGAGGTGTG CAGGCTGGGT GTGGTGGCTC ACCCCTGTAA TCCCAGCACT TTTGGGAGGC 3960 

20 

CAAGGCAGGC AGATCATGAT CATGGTCAGG AGTTTGAGAC CAGCCAGACC AACATGGTGA 4020 

AACCTGTCTC TACCAAAAAT ACAAAAAAAA AAAAAAAAAT TAGCTGGGTG TGGTGGCAGG 4080 

25 TGCCTGTAGT CCCAGCTACT TGGGAGGCTG AGGCAGGAGA ATCACTTGAA CCCAGGAGGC 4140 

AGAGGTTGCA GTGAGCTGAG ATCACGCCAC TGCACTCCAG CCTGGGTGAT AGAGTAAGAC 4200 

TCTGTCTCAA AGAAAAAAGA AAAGAAAAGA GACATGCAAA TTAAAAACAG CTACTCTCTT 4260 

30 

TCCCAGTGGC TTCCATTAAT TTCAGGAATT TCCCCTTGAG TGGCTTGGGT TGAGAGGTTG 4320 

ATGACCTGTC AGTTAGACTC AAGAAAGCTG AATCTAGGAG AACCGCTATT TTTTTTTTAA 4380 

35 GGGAATCTGC CAAATTTCCT TGCTGTGTAA AGCTTCAATG TGTATAGCTT GGCTTTTGTA 4440 

GATTGTATTT TCTTGAAACT TAGCACACAG GTATTTGCAG AACTTCTAGG AGTTAATTTT 4500 

TCTGCTCCAC TCGGCTCTCA GTCTTTTACG GCATGGCCAA GAGAGCTATT TCTTGGC CTC 4560 

u 

CTGTGAAAAG TTTCTTTCTT CCTTTCTCCC CACCTCCACA TCCTTTCAGC TCCTCTTTGT 4620 

ATCCAGGACA AGAGGAAATG GACTTCAGCC ATGGTGAAAG GAGTGTGAGT TGGCTTTTGA 4680 
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AGGAAAAGTT ATGGTAACGG AAACAGTTCT AGAACAGAAA TCTTAGAAAT GACCAAATTT 4740 

TACTCAATGG CGCTTTAAGA GGCAGATATA ACTTATCCAA GGAATTAAAA CCCAAGCCAA 4800 

CAGAAGAGAA TGTTCTAAAA TTAAAATGAA AGCCACTGGG AAAATAGAGC CTGCCCATCA 4860 

TGAGAGGAAG AATAAGCAGA AATATGTGTA AAGCTTTAGA AGCCAAAATC AAAGTGAGAG 492 0 

ACATCTCGCC GAGAGAGGTG TGAGGAATGG AATAGGTGGC AGACATGTTG TGGAGCCTCC 4980 

TCACTGAAGA CTTTTAAACA TAGATATTCT TATTTATTTG AGTTGTC TTG GGAACCACCT 5040 

TATATTGCTT TTAAGTCATG TTGCTGATTC AAGAGTCTCG TAGGTCCTTC CAAGCATCCT 5100 

TAGGGCCTCA GGTGAAAATA AAATCAGATA CAACCATGCA AAGCTCTAGG GAAGTGGGAA 5160 

GTTGAAAATG CCTAGGATCA GCTCTTTGGC TACCTGTGGT CACTCCTTTT ATTGTCGTCT 5220 

GCCCAGGTGG CAGCTACAAG AAGATTGGCT ACTATGACAG CACCAAGGAT GATCTTTCCT 5280 

GGTCCAAAAC AGATAAATGG ATTGGTGAGT GGATCTTGTT TGTATTTTCC TTCAGCCCCT 5340 

CTCGACAGTC AAGGGAAAAA GTCATGCCTT TGAGTGAGGA TGGAATGGTA GAGACTGTTA 5400 

GGTTGGAATG TGGCTGGCAG CTGGGCCAGG AGAAAGGGTT AAGTGAGAGT GAATACAACC 5460 

CCTAAGGCGT GGGTAGGGGA GACTGGTGTA TTTGGAGAGG GAATAGGCGG TGGTTAGTAC 5520 

TATTTTTAAT GGTGCATTGC TGGGGTAACT GGGGATTAGA GGCAGGGGGT GGGCAGAGGG 5580 

CGGGAAATGG AAACTCCATT TGGGTTTCCC AGATGTCCTG GTGTCTTGAT ATATTTGAAC 5640 

CAGCTACTTC AAGCCCAGAG CTGTCTCTTT GTCTGTCTCT GTCAGGAAAA CGGTTGCTTA 5700 

AACTATGGAG GAGGAGGGAA AACCTCATGT AATTGTCATC TGCCAAAATG TGCTTTTTAT 5760 

TTTTATATGT ATTTTTAAAA ATTTTCCTAT TTTTATGTAA TTTAGAGGTA GACGTGCAGT 5 820 

40 TGTGTTACAT GAATATATTG CATAGTGGTG AAGTCCGGGC GTTTAGTGTG CCTGTCACCC 5880 

GAACAGTGCA CCTTGTACCT AATAGGTAGT ATTACATCCC TCAAAATATA CTTTTTAAAG 5940 



25 



30 



35 



H1865-1 WO SEQ 



110 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 

AGAGAAAGCA AGCAGTTATT CTTTGTGTAC TTGGTCTAAA TGATAGGACA TAGGAGAGAA 60*00 
ACTGAAGGTG GACAAAAGGA AGGACCTACT GATAAAAGAA AGCCTCCTTG AGAATGAAGG 6060 



5 GGAGGCTCAA CCATTGAAGA TGGCTGCCGT CTGCCCTGCC CAGCAGATAT CCAGTCATTC 
CCAGCACTGC TGGAGTTTTG CCCTTTTTTT TTTTTTTTTA CAATTCGAAT TTAGGACAAT 



10 



GTTCTGGATT GCTATAAATG CTGCATGGCC TAAATTATTC TTTAAAAAAA AAACTAAGCA 
AATTGAAATT AGTTTTTTTT GGTGAACTCT GACAAATTTG AACTTCCCCC TAATAATAAC 
TGGAAAACAT ATTTGGGAAT ATTACCCTGC CAGGATTAAA ATTTCAGATT AGCTTTCCTT 



15 CTTTTTTTGT TTGTCTTAAG AATAGGTGTC CACACTAGAT ACTTCAAGGC CTTTTTAGCT 



6120 
6180 
6240 
6300 
6360 
6420 



TTATGATTCC ATAATTGTCA TTTAAAACTT TGATTTGGGT TATAAGAAAC CTTATAACAT 6480 



20 



TTTTTAATGA TCCCCTTCTT TCTCCTCCCA TTTTCCTTTG CTGTAAGAAA GACAGAAAAA 6540 
CTTAAAGAAC AAACAAAAAC AAAGACTACA ACTTTGGGGA CATGCCTCAG CATTTCCCAA 6600 
CCTATGGATA GACCATTCAC TCCATCTTCT CATCTCATTT CTGGTTGCTT CCTAACGGCC 6660 



25 CCAGTGGCAC TGAGCATTCT GCCTGCAGTA ACCTCTGTCC AGTGCAGTTA GGGCCTCATG 6720 
TCCCCAGCCA ATGACTGAAT GTCCATCAGC AATCTAGTTC TTTGCCCTTT TCTCCTATCC 6780 
CGTCTTCATT CCTTTGTCCT CCTTCCCTTC TCTTTTCCCT TCCCCTCTTC CTCCCCTGTG 6840 



30 



CCATGCAGGA GGGTCCCCCC CAGCTGACCA GACCCTGGTC ATCAAGACAT TCCGCTTCCT 6900 
GTCACAGAAA CTCTTTATCT CCGTCTCAGT TCTCTCCAGC CTGGGCATTG TCCTAGCTGT 6960 
35 TGTCTGTCTG TCCTTTAACA TCTACAACTC ACATGTCCGG TAAGTTTCTC TTCTGACGTT 7020 



40 



TTCCTTGTCT GCCTCTCTGA GATACTGATC ATGTTCCTGG ACAGGATGAG AATAAAACCT 
GTGTAACTCC CATGGCCATG TATCATGGAG TTTTTCATTC TGACTTGTTG AGAATGAAAA 
CAGGGAAACC AGATATAACC CCCACTCCTA CTCCAAAGTA GCTAACGGGA GGAAAAAAGA 



7080 
7140 
7200 



AAAGAAGAGA AAAAAACAAC CTTTGGGGCC AGGTCTCACA GTCTTGGACT CTACATAAAT 7260 
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AGCCTGTATT CTAGTGGGGG CCTGTGCTTG GGAAGCCCTC TGCAAC TCC A TCTTCAGCCC 7320 

CATGACTGCA TTGCTCTGCC TCTCAAGGCT CCACTGTCTT CTCCAATCCT GTCTTCCTTT 7380 

AGCCCCTGGC CCTGAAATTA GGGTCATGCC ATTGCGTGGT ATTTGGAGAG CTCAGCCTCC 7440 

CTGGAGAAGA GGGGTAATTC TCTCTCCCTC TCACCCTCTC CACCTCTGCC CTAGTTATAT 7500 

C C AGAACTC A CAGCCCAACC TGAACAACCT GACTGCTGTG GGCTGCTCAC TGGCTTTAGC 7560 

TGCTGTCTTC CCCCTGGGGC TCGATGGTTA CCACATTGGG AGGAACCAGT TTCCTTTCGT 7620 

CTGCCAGGTG AGGAGGTGGT GGGCAAATTC CTTACAGGAT GTGACTCTCC CACCCGTCTC 7680 

AGGAGCACCT TCCATGATTT ATGATTCTCT GCCCTTCCTC CTCAGCTTTC CCTGACTCTT 7740 

GTCCCTGTTC TTTCCTTCTA GCATCACCCC TCTGTTCTCT GTTTGGCTCT GTCCCTTCTT 7800 

TCTGTGTCTG CAGGCCATTT TCATTCTGTA GTTTACTTGT CAGTTCCAAG GTTGCCATGG 7860 

CAGSCCTYGC AGAGAAGAGG AGGGAGCCAT TGAAGGCAAA GGAAGGGGAT CTGCTCAAAG 7920 

GTCTCCTGAA CAATGGTGGC TTGTCTGTGG TATGGGGGCT GAGAATCAGA ACTGTGGACT 7980 

TTTTTTGGGA GCCTTTGTTG GGTTTGGAAG GATAGAAGCA GAGATGGAAA CACAGCAGAG 8040 

AGTTGGGGGG AAGGGACCAC TGCCACACAG GGGAGGAGGG GCTCTGGGAC TGTTGGTACA 8100 

TGGAAGGTTC TAGTGCTGTG GGGAGAGGCC AGCTTCAACA GTGATAGTTG AGTGGTTCTC 8160 

TTTTCCACTG GTGGAAACAC CCACTCTTTC TCCTGATCTG CCTGCCTGTC CTTGCTCTCT 8220 

CTTTTTCCTC TGCTCTGTGC TGTCCTGATC ATACATCTGT GCACATGGCA TTTCCATGCA 8280 

CATGCACATG CAGTTCATCA GGAATCCTCT GTTCCCAGTG AGGCCAGAGT GCAGCTGGAG 83 40 

AAGCAGACAA TTAGCTGTAG TGCAATAGGA GAGGTTCCAG AGTAGGGATC TGCACAAAGT 8400 

GCTTTGGGGG CAAAGAAGGG AACACAGTTC ACTGCTGGCG TGATTGGGTG GACCTCACTG 8460 



AAGAGGTGGC ATTTGAATAC TGAAGGACAA ATAGGATTTT ATCAGCTAGA GAAATAGAGG 8520 
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AAGGCTACTT CAGGGGCATA GGGAGCATCG TGTGGCTAGA AAATACATGA AAGAGAGTAG 85B0 

ATGAAGAGAA AGTGAGTAGT TCAGCATGGC TGGAG CGTGG GGTAGGTGTG GGGCTGGGAG 8640 

ATGAGCCTAG CTGGACAGGT GGATGGGAGC ATGTTGTGAA GGGTCTGTGT CATATCCAGA 8700 

AGTGTTCAGG CTATAACTTA TAGATATTGG GGAGTGGTTG GAGGTTTTTG GCCACTAAAG 8760 

CCAGGAGGTT TTAGCAAGAT CACCCTGGTG GTGTGGAAGT AGAGGGTGGA TGGGAGGAAT 8820 

TGTTCAAGGT GGGGAGACTG CTCTCCTCCT GCCGCTCCCC GTCC TGCTC A CATTTTCGCA 8880 

TCCTCCCTGT GCCACCATGA GCTCCCTGCC CGTGCTCCCT GCCCACTCTC CCTTAGGGTT 8940 

CTGCCCATCC TTACTGCAGT CCCGGCTACT ACTCTACCCT GTTCTGCCTG TGCCCTCTCT 9000 

TCCTTTCTAG GCCCGCCTCT GGCTCCTGGG CCTGGGCTTT AGTCTGGGCT ACGGTTCCAT 9060 

GTTCACCAAG ATTTGGTGGG TCCACACGGT CTTCACAAAG AAGGAAGAAA AGAAGGAGTG 9120 

GAGGAAGGTG AGCTGCTGCC CAATCCTCAG CCCCCARATC CTTGGCTCCT GGGGCACAGA 9180 

GCATTTTCCC CTGACGTGCC TGTTCTCCCC ACATATTTAT CCAGACTCTG GAACCCTGGA 9240 

AGCTGTATGC CACAGTGGGC CTGCTGGTGG GCATGGATGT CCTCACTCTC GCCATCTGGC 9300 

AGATCGTGGA CCCTCTGCAC CGGACCATTG AGGTACCACT GGAGAGGAGG TGCTATGGTC 93 60 

AGGAGAATGA GCAGGGCTCA GTGGCCATCA GGGCCCTGGG GCTGTGTGTG TCTTGAGGGA 9420 

TGAAGCTACT TGGAGAGAGT GCCTTCCTCG TATTGGAAGC TCTTCCTTTC CTTCCTAGAA 9480 

GGAGCCCCTC ATAGGCCTCC AGATTCAGCT GAAGAAAGGA AGGGGTGGGA ATCTGGGAAG 9540 

GGTGTGTAGA ACTTCCAGGC ATCAGGGAAA GTGGGGAACA AGCACCTCCA AGGGTTCAGG 9600 

AAAACATTCT TAGGCCTAGA ATGAGATTTG GCATCAGCAT TGAGGGTCTC ATAGGAAAAC 9660 

AGTTGGAAGC CAGAGACTGA GAAGCGTTGA GGAGAGGAGG GGAGGCTGGC AACCATCTTT 9720 

CTTGTGACCT TGTTTCTGCC CTAGACATTT GCCAAGGAGG AACCTAAGGA AGATATTGAC 9780 

GTCTCTATTC TGCCCCAGCT GGAGCATTGC AGCTCCAGGA AGATGAATAC ATGGCTTGGT 9840 
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GTGTGGGATG TGGGCAAAGG AGGGCAGGGA 
GAGGGCTTGG . AGGGAGAGGG GTCTTTGGAA 

5 

GAACACCCTA CTCTTTGTTA TTGCACTAAT 
TTGCTATGAG GAGCTTAGGG CCCAAAGCAC 
10 CAGAGGAACA AGTGGGGGTA ACCCCACCTC 
TGGCCTTATT AGCATTGTTC GATTCATGGT 
AAGCCACCTT CCCACCTGTA GGGTAAGGGT 

15 

CTAGATATTC AACAAGCCCT TGCCTGACTG 
TGAATCCCAG GAAAAAGTGA TGTGGTCTGG 
20 CACAGGGTGA GGTTTGCTAG GTCAGAGGGG 
TTCAGAGTAG GTGAAGGGCA GAGAGGGGAT 
GTTGCTCAAG TGACTCTCTC CTGCCATCCT 

25 

TGCTGCTGCT GGGAATCTTC CTTGCTTATG 
ATGATCACCG GGCTGTGGGC ATGGCTATCT 
30 CATTGACCCT GTAGGCTGAC CACAGCAGCC 
TAGATCTGGG ATCGGTTGCT TAGAAGTCTT 
AGCACTTTAC AGTTTACAAA GCTCACTACA 

35 

AGTGAAGTAG ATATTAGTAT CCCCACTTTA 
AAATTG CTTG TCTGTGGTTA ATGGGCTGGA 
40 TCTGGAGGAC CCGGAATCTG TGCATAGAGA 
TTAACCAAGA GTGAAAACTG GTTTGGGACA 



PCT/SE98/01947 

TGCACAAAGG CAGGAGGGAA GGCAGGGGTA 9900 

GAGGAGGTAG AGAGCTTGTC AACCCAGTTT 9960 

CTTTTCTGAG AATAGGGGAG AGTTGCTCTT 10020 

AGAAAGCACA GATGAAGAAC TTGTGTTCAG 10080 
CAGACTTGAC ATT ATCTT TT AGATCCCCCT ' 10140 

CACAAATTGC AAACCTACCC TCTGCCTGGA 10200 

GAGGCATGTG TGGCCCAGAC TGGCCTATTT 10260 

ACAGCAGCTT GCCACCATTG CTTTCCTGTG 10320 

GCAAGTTGGG TGGACATAAG GGATAGGGGA 10380 

TTGGATTGGA GAGGAGGGCC CCCTTTCCAT 10440 

GGGGATTGAG TGAGGAGCAT TGTGGTCCTT 10500 

AGGCATTTTC TATGGTTAAC AAGGGGCTGC 10560 

AGACCAAGAG TGTGTCCACT GAGAAGATCA 10620 

ACAATGTGGC AGTGAGCACT GACCCCATGG 10680 

CAGATATAGA GGACTAGGAA GAATCAATGC 10740 

AAAAAGTTTG TTAATTCTTC AGGTCTATAA 10800 

GACATTGTAT CATTAATCTT GCAACTACCC 10860 

TAGGTGAGGA AACAGAAACA CAGAGACGTT 10920 

CTCTATTGAC ATTTCCTGCC AGGGACCGAC 10980 

TCCTGGGAGT TCCTGCCTTG AGGGGAGGGG 11040 

GTTTGAGATT TTTCTCAATC TATATTGAGG 11100 
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ATGATCCTGA ATTTGGATCC TTTTCAAAGG GAAAGTTCAC CAGGAAACTG TCTGCATAGA 111*00 

CTCCCTCCCA TGGGAAGTAA ACTCTGGATC TTGTCTGAGC CTGCAGACCT GAGACTCCCT 11220 

5 CAATGTGTCT TTCCCTCTAG GTCCTGTGCC TCATCACTGC TCCTGTCACC ATGATTCTGT 11280 

CCAGCCAGCA GGATGCAGCC TTTGCCTTTG CCTCTCTTGC CATAGTTTTC TCCTCCTATA 11340 

TCACTCTTGT TGTGCTCTTT GTGCCCAAGG TAAGGATCTG GCTTTTCTCC CACCCTCTTT 11400 

GTTC CCATGT TCCCTCCATC CCTCCTTCCT ATATTACTGA GTTCCTCTGC CCTTCCGTTC 11460 

ACCCTCCTCT CACTCCTCCC CTTGTTTTGG GCCCAACTCT TATCAGCATT CCTTCCACCT 11520 

15 CCAACCTTCC ATCAGCCAGT CACTAGTACA GTCCTTGCTG GGCCACCCCA CGCCCAAACA 11580 

TTTGCCCCCA GATGCGCAGG CTGATCACCC GAGGGGAATG GCAGTCGGAG GCGCAGGACA 11640 

CCATGAAGAC AGGGTCATCG ACCAACAACA ACGAGGAGGA GAAGTCCCGG CTGTTGGAGA 11700 

20 

AGGAGAACCG TGAACTGGAA AAGATCATTG CTGAGGTGCG GGGGTGGGTG TCAGGGTAGG 11760 

GTGTTGGAGT GGTCCAGGAG GCTTGCGTCT TAGCTTGGGT TGTC TGAAGC CAAGCC TGAG . 11820 

25 ATACAGGGTC AGATGTTCTT GGCTCATGGA GGGAGGGTCC TAGGAGACAA CCTGTAAGGA 11880 

GTGAATGGAG CAGCATAGGG GAGGGGAAAG GGCTGAGCAA GATTCTATCT CAGGCAAAAT 11940 

CCAGTGTTGG CCTGGCAGGT GGAAGGGCTC TGGAGTGGGA GCTATGTGGT TGACTCAGCC 12000 

30 

TCCTTAAGGC AAGAGGATGG CTGTTGGCTG TAGGTGACAA CTGGAGAGAG GCAGCTGTGA 12060 

GCCTCTAGTA GTCAACACTC ACAGCAGCTG GGTGTAGCAT GCAGCCCCAG CATAAAGGAC 12120 

35 CTGGGCAGGC GTTC AC TGTG CCCCAGGCTG TCATTAGGGG CTGGTGCAAT GCCAAAGAGA 12180 

GGGATGTTCC AACTGGGTTG ACACATCTCT CTGATTTATT GGAAGCTCTG TGCACTGACT ' 12240 

TTTCTCTCCT TCCCCACTTT TTCCTTTTGT TTTTAAATTC TCTCTTATTT CCCTGATCGC 12300 

ATTTTTTCTA TCGGTATCCT TATGTTCTCT GGCTTTTCTT GTTCTGTTTT GATTTCTCCT 12360 

TTTAATTTAT TCTGTCCACT TACCCTACGT CCTCCCCCTA CATTTTTCTG TGCCCTTCCT 12420 
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CTCTTTCCCT GTGCCCTTCC TCTCTTTCCC TCCTCCCCAC TCCTTCATCA CCTCCTCTTC 12480 

TCCTACTATC CCAATTGTGC TTCTTCCTCC AGAAAGAGGA GCGTGTCTCT GAACTGCGCC 12540 

5 

ATCAAC TCC A GTCTCGGCAG CAGCTCCGCT CCCGGCGCCA CCCACCGACA CCCCCAGAAC 12600 

CCTCTGGGGG CCTGCCCAGG GGACCCCCTG AGCCCCCCGA CCGGCTTAGC TGTGATGGGA 12660 

10 GTCGAGTGCA TTTGCTTTAT AAGTGAGGGT AGGGTGAGGG AGGACAGGCC AGTAGGGGGA 12720 

GGGAAAGGGA GAGGGGAAGG GCAGGGGACT CAGGAAGCAG GGGGTCCCCA TCCCCAGCTG 12780 

GGAAGAACAT GCTATCCAAT CTCATCTCTT GTAAATACAT GTCCCCCTGT GAGTTCTGGG 12840 

15 

CTGATTTGGG TCTCTCATAC CTCTGGGAAA CAGACCTTTT TCTCTCTTAC TGCTTCATGT 12900 

AATTTTGTAT CACCTCTTCA CAATTTAGTT CGTACCTGGC TTGAAGCTGC TCACTGCTCA 12960 

20 CACGCTGCCT CCTCAGCAGC CTCACTGCAT CTTTCTCTTC CCATGCAACA CCCTCTTCTA 13 020 

GTTACCACGG CAACCCCTGC AGCTCCTCTG CCTTTGTGCT CTGTTCCTGT CCAGCAGGGG 13080 

TCTCCCAACA AGTGCTCTTT CCACCCCAAA GGGGCCTCTC CTTTTCTCCA CTGTCATAAT 13140 

25 

CTCTTTCCAT CTTACTTGCC CTTCTATACT TTCTCACATG TGGCTCCCCC TGAATTTTGC 13200 

TTCCTTTGGG AGCTCATTCT TTTCGCCAAG GCTCACATGC TCCTTGCCTC TGCTCTGTGC 13260 

30 ACTCACGCTC AGCACACATG CATCCTCCCC TCTCCTGCGT GTGCCCACTG AACATGCTCA 13320 

TGTGTACACA CGCTTTTCCC GTATGCTTTC TTCATGTTCA GTCACATGTG CTCTCGGGTG 13 3 80 

CCCTGCATTC ACAGCTACGT GTGCCCCTCT CATGGTCATG GGTCTGCCCT TGAGCGTGTT 13440 

35 

TGGGTAGGCA TGTGCAATTT GTCTAGCATG CTGAGTCATG TCTTTCCTAT TTGCACACGT 13500 

CCATGTTTAT CCATGTACTT TCCCTGTGTA CCCTCCATGT ACCTTGTGTA CTTTCTTCCC 13560 

40 TTAAATCATG GTATTCTTCT GACAGAGCCA TATGTACCCT ACCCTGCACA TTGTTATGCA 13620:. 

CTTTTCCCCA ATTCATGTTT GGTGGGGCCA TCCACACCCT CTCCTTGTCA CAGAATCTCC 13 680 
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ATTTCTGCTC AGATTCCCCC CATCTCCATT GCATTCATGT ACTACCCTCA GTCTACACTC 13740 

ACAATCATCT TCTCCCAAGA CTGCTCCCTT TTGTTTTGTG TTTTTTTGAG GGGAATTAAG 13800 

5 GAAAAATAAG TGGGGGCAGG TTTGGAGAGC TGCTTCCAGT GGATAGTTGA TGAGAATCCT 13860 

GACCAAAGGA AGGCACCCTT GACTGTTGGG ATAGACAGAT GGACCTATGG GGTGGGAGGT 13920 

GGTGTCCCTT TCACACTGTG GTGTCTCTTG GGGAAGGATC TCCCCGAATC TCAATAAACC ■ 13980 

AGTGAACAGT GTGACTCGGC ACCTTGCAGT CTTCCTGTGA ACAGAATGGG CTTCAATCCA 14040 

AGAAGGGAGG CTCAGAGGAC TCCAAGTTCA TGAAAAGGCA TTAAAGCGGA GGGTGAAAAG 14100 

15 AGGTGTTTTA TTGATCCATT GAGGGCTTAG CAGAATGAAG CAGGACATGA TTAAGTCTGA 14160 

GATTAGTGAG TGAGGACACT ACTGGTTAAA AGTGTGGGCT CTGGAGTCAG ACTGCCAGGG 14220 

TATCAGATCC AACCACATGC AAACATTTTC TTAGTCTCTA TTCCCCATGT CCTCATTTAT 14280 

20 

GAAAATGAGA ATAACAGTAA TACATTCCTC CATAGGTTGG GTACAAAGAC TATTATAAAT 14340 

TGTGCATTCA GGTGCCTAGG TTGGCCCTTG GGCCATGGTA TATGTTGCGT GAATGTTAGC 14400 

25 CTCTGTCCCT GCTGTTTAAT G AGTTC CTTG ACAGTAGTGG GCATGTATTG GGAGCCTGGA 14460 

GCAAGTGCCT AAGCATCCCC TCTAGGGACG CTCCTTCCCA GGAACTAAGA AGAGTAAAAG 14520 

AATGATGACT GCTAGAAGGT AATGGATGAG ATGGCTGCTG AGTGCTTCCA ACCTTAAACA 14580 

30 

TCTTTGTTTA GGAACTCTGA GCATCTTGGA AATAATTTGC TATCAAACTG AAAAAAAATC 14640 

TTGAATGGAC AAGGGCAAAA ACATTTGCCT GAGACTTTAA ACATTTTTTG TGTCATCTTG 14700 

35 GAGAGTGCTT TTTTGAAGCT CAAATTTTCT TTTATTTTGG CACTGATTTT TAAAGTGATT 14760 

CTCAGATTTT TGTAAGAC AG CTGCAAGGGT TGGATGGGCC CTGTCATTCA CTGACCTGTT 14820 

AAGAGCCAAT TTCTGAACTT C C ACT AAAGC ATGGGCTGGT TGAATCTTAG AC C AGTACTT 14880 

40 

G AAAAACTTT - CCACTGTGAT TATCCACCTT GGACCAGTTG GACTTAATTT GAGTCTCTCT 14940 

TCTTCCCACA GTGAAATATC CGCTAGGAAA AGAGAAGAAG TCTATGGAGT AGATAAGCCT 15000 
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GCAGTTTGGA AAATGAATAG TTGGCATCAG AACAGCAGCA GGAGGCTGGG TGTGGTGGCT X5060 

CACGCCTGTA ATCCCAGCAC TTTGGGAGGC TGAGGTGGGT GGATCACGAG GTCAGGAGTT 15120 

CAAGACCAGC CTGGCCAAGA TGGCGAAACC CTGTCTCTAC TAAAAATACA AAAATTAGCT 15180 

GGGCGTGATG GTGGGTGCCT GTAATCTCAG CTACTCGGGA GGCTGAGGCA GATAATTGCT 15240 

TGAACCCAGG AGATGAAGGT TGCAGTGAGT TTGTGCCACT GCACTCCAGC CTGGGCGACA 153 00 

GAGTGAGACT GTCTCAAAAA AAAAAAAAAA GAAAAAAAAA AAAGAAACAA CAGTAGCAGG 15360 

AGCTATAGAA CAGCCCTGGG TAGAACCTAA AAGACCCAAA TTATCATCTC AAACTTGCAT 15420 

TGCACTTAAG TGGGCTGTAA ATTATAAACA AAGGGTGAAA AGTTCTACTG TGGCCAAAGG 15480 

TAAGCCAGAC ACTCTGCTAG CAGGAGTGCA GGAGTCGAGA GCCAAACGGT GCGGCTAGCA 15540 

GAGTGCCCAG TGCAAAGGGG TGGGAAGGAG TGAGATTGAG AATATTAAAA AGGTACTTAG 15600 

AAGAGAACTT GTAAGATTTT TACTGGCCAA ATTTAAAACA TGAC TGAGC A CTATTTTTCA 15660 

TACAGGCCTC CTACTAATAA GAAAACAATT TTGAGATAAC TACTTATTTG AGTTCACAGT 15720 

TAATGTTCCT GATGATTAAG ATCAGTTGCA AATGTTCATC TGTCAATGCT TATCTACAAT 15780 

GAGACTTCAT GTATTCATTT CTGAAAGTGT CTTTTCAGGG TGAATGGTGC TATTGATTAG 15840 

30 CACTAATACT AATTATTAGT ACATTATATA TAATTACAAT GAGATATACA CACACATACA 15900 

CACACACATA TATACATATA CACACACACA ATGATATATA ATTAATATAT AATTGTACCC 15960 

CAAGGGGTGC AAAGGAGATG TGTTGCCAGG TGGAGAGGCT ACCGCTTGGC AGTTCTGGGA 16020 

35 

GGACTTGCTC CCTGTGCACT GTGAGGCAGG CTTTGCCTTT CAGACCTGCC TTTGGGTAGG 16080 

GTTCAGATCA CTTTCTAACT CTGGAATGTC CTAGAATGTA GACTGCCTGC AGGCTTCCAT 16140 

40 GTCCCCTGCT TTTCCCTGAC TTAGCCTGTT GCCTCCCTGC CCTCCTGTTG GTTGTCTACC 16200 t , 

AGTAGAGAGC ACTTTGTGTG CACTTGGCTG CTACATTAGT TAGGTGATCT TCAACAAGTG 16260 



25 
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TTGATGGTGA GTTGCTGTGG CAGGTGCTTT TTTTGGCACT GAGGCAAAAT GGTGAGTAAG 16320 

ATGGCTTTCA AGCGTTGTAC CTTCTCGACG TGGGAAGATG ACCAGTAAGC AGAAAAACAA 16380 

ACGAGATCAC TTAAGAGAGC AACCAGGAGT GTTGGGCATC TCACAGCCAT TAGCTCTGGT 16440 

GTGAAGGACA AATCTAAAAG CAAGGGGACT GTGTGTTCAT TTTCTGGGGT CACAAAACTA 16500 

AGGAGCAAAG CCAGTATTCA AACTGCATAT ATTATTATCT ATTGCCACAA AACCTGTTAC 16560 

CCTAAATGGC TTCAGACAAC AATAATCATT TATTATCCAT CATGGTTTTT GTGGGCTAGA 16620 

AATTCAGATA GGGTACAGTG GGGAGGGTTC ACCCTGCTCC ACAGCATCTG GAAGATCTGA 16680 

AGTCTGAGGG TTGTTGTCCA AAAGATC 16707 



(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 50 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 
ATGCGCGCCG GCAGCCAACA TGCTGCTGCT GCTGCTGGTG CCTCTCTTCC 
(2) INFORMATION FOR SEQ ID NO: 63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 
GGTC AT CC AG CGTTGAGGTG AAGAC 25 

(2) INFORMATION FOR SEQ ID NO : 64: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

15 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
GAAGGTTGCC AGATTATACA TCCGC 25 

20 

(2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
CCACGATGAT TCGAGCATCT TGACG 25 

35 

(2) INFORMATION FOR SEQ ID NO : 66: 

( i ) SEQUENCE CHARACTERISTICS : 
40 (A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 

5 

CTGGTTCCTC CCAATGTG 



(2) INFORMATION FOR SEQ ID NO: 67: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

( C ) STRANDEDNES S : s ingle 
15 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 67: 

20 

CCAGTGGACT ATGAGATTGA G 21 



(2) INFORMATION FOR SEQ ID NO: 68: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 
30 ( D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 



35 



CTGGTTCCTC CCAATGTG 18 



(2) INFORMATION FOR SEQ ID NO: 69: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 
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20 



(C) STRANDEDNESS : single 
<D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 
CCAGTGGACT ATGAGATTGA G 21 

(2) INFORMATION FOR SEQ ID NO : 70: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2700 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: cDNA to mRNA 

( iii ) HYPOTHETICAL : NO 

(iv) ANTI-SENSE: NO 

25 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/KEY: CDS 
30 (B) LOCATION: 1 2700 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 70: 

ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
35 Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
1 5 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 9 6 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
40 20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
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35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
5 50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

10 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 28 8 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

15 GTG AAT CGA ACG CCA CAC TCA GAA CGG CGC GCA GTG TAC ATC GGG GCA 33 6 

Val Asn Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala 
100 105 110 

CTG TTT CCC ATG AGC GGG GGC TGG CCA GGG GGC CAG GCC TGC CAG CCC 3 84 

20 Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro 
115 120 125 

GCG GTG GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG GAC ATC CTG 432 

Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie Leu 
25 130 135 140 

CCG GAC TAT GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG TGT GAT CCA 480 

Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro 

145 150 155 160 

30 

GGC CAA GCC ACC AAG TAC CTA TAT GAG CTG CTC TAC AAC GAC CCT ATC 528 

Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie 
165 170 175 

35 AAG ATC ATC CTT ATG CCT GGC TGC AGC TCT GTC TCC ACG CTG GTG GCT 57 6 

Lys lie lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala 
180 185 190 

GAG GCT GCT AGG ATG TGG AAC CTC ATT GTG CTT TCC TAT GGC TCC AGC 624 

40 Glu Ala Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr Gly Ser Ser - 
195 200 205 

TCA CCA GCC CTG TCA AAC CGG CAG CGT TTC CCC ACT TTC TTC CGA ACG 672 
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Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr 
210 215 220 

CAC CCA TCA GCC ACA CTC CAC AAC CCT ACC CGC GTG AAA CTC TTT GAA 720 
5 His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu 
225 230 235 240 

AAG TGG GGC TGG AAG AAG ATT GCT ACC ATC CAG CAG ACC ACT GAG GTC 768 
Lys Trp Gly Trp Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val 
10 245 250 255 

TTC ACT TCG ACT CTG GAC GAC CTG GAG GAA CGA GTG AAG GAG GCT GGA 816 

Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly 

260 265 270 

15 

ATT GAG ATT ACT TTC CGC CAG AGT TTC TTC TCA GAT CCA GCT GTG CCC 864 

lie Glu lie Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro 

275 280 285 

20 GTC AAA AAC CTG AAG CGC CAG GAT GCC CGA ATC ATC GTG GGA CTT TTC 912 
Val Lys Asn Leu Lys Arg Gin Asp Ala Arg lie lie Val Gly Leu Phe 
290 295 300 

TAT GAG ACT GAA GCC CGG AAA GTT TTT TGT GAG GTG TAC AAG GAG CGT 960 
25 Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg 
305 310 315 320 

CTC TTT GGG AAG AAG TAC GTC TGG TTC CTC ATT GGG TGG TAT GCT GAC 1008 
Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala Asp 
30 325 330 335 

AAT TGG TTC AAG ATC TAC GAC CCT TCT ATC AAC TGC ACA GTG GAT GAG 1056 

Asn Trp Phe Lys lie Tyr Asp Pro Ser lie Asn Cys Thr Val Asp Glu 
340 345 350 

35 

ATG ACT GAG GCG GTG GAG GGC CAC ATC ACA ACT GAG ATT GTC ATG CTG 1104 

Met Thr Glu Ala Val Glu Gly His lie Thr Thr Glu lie Val Met Leu 

355 360 365 

40 AAT CCT GCC AAT ACC CGC AGC ATT TCC AAC ATG ACA TCC CAG GAA TTT 1152. 
Asn Pro Ala Asn Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu Phe 
370 375 380 
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GTG GAG AAA CTA ACC AAG CGA CTG AAA AGA CAC CCT GAG GAG ACA GGA 
Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly 
385 390 395 400 



12~0~0 



GGC TTC CAG GAG GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG GCA 
Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu Ala 
405 410 415 



1248 



CTG GCC CTG AAC AAG ACA TCT GGA GGA GGC GGC CGT TCT GGT GTG CGC 
10 Leu Ala Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg 
420 425 430 



1296 



15 



CTG GAG GAC TTC AAC TAC AAC AAC CAG ACC ATT ACC GAC CAA ATC TAC 
Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie Tyr 
435 440 445 



1344 



20 



CGG GCA ATG AAC TCT TCG TCC TTT GAG GGT GTC TCT GGC CAT GTG GTG 13 92 

Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val 
450 455 460 

TTT GAT GCC AGC GGC TCT CGG ATG GCA TGG ACG CTT ATC GAG CAG CTT 1440 

Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu 

465 470 475 480 



25 CAG GGT GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT 
Gin Gly Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp 
485 490 495 



1488 



GAT CTT TCC TGG TCC AAA ACA GAT AAA TGG ATT GGA GGG TCC CCC CCA 
30 Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ser Pro Pro 
500 505 510 



1536 



35 



GCT GAC CAG ACC CTG GTC ATC AAG ACA TTC CGC TTC CTG TCA CAG AAA 
Ala Asp Gin Thr Leu Val lie Lys Thr Phe Arg Phe Leu Ser Gin Lys 
515 520 525 



1584 



40 



CTC TTT ATC TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT GTC CTA GCT 1632 
Leu Phe lie Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala 
530 535 540 

GTT GTC TGT CTG TCC TTT AAC ATC TAC AAC TCA CAT GTC CGT TAT ATC 1680 
Val Val Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr lie 
545 550 555 560 
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CAG AAC TCA CAG CCC AAC CTG AAC AAC CTG ACT GCT GTG GGC TGC TCA 1728 

Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser 
565 570 575 

5 

CTG GCT TTA GCT GCT GTC TTC CCC CTG GGG CTC GAT GGT TAG CAC ATT 1776 

Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie 
580 585 590 

10 GGG AGG AAC CAG TTT CCT TTC GTC TGC CAG GCC CGC CTC TGG CTC CTG 1824 
Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu 
595 600 605 

GGC CTG GGC TTT AGT CTG GGC TAC GGT TCC ATG TTC ACC AAG ATT TGG 1872 
15 Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie Trp 
610 615 620 

TGG GTC CAC ACG GTC TTC ACA AAG AAG GAA GAA AAG AAG GAG TGG AGG 192 0 

Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg 
20 625 630 635 640 

AAG ACT CTG GAA CCC TGG AAG CTG TAT GCC ACA GTG GGC CTG CTG GTG 1968 
Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val 
645 650 655 

25 

GGC ATG GAT GTC CTC ACT CTC GCC ATC TGG CAG ATC GTG GAC CCT CTG 2 016 

Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin lie Val Asp Pro Leu 
660 665 670 

30 CAC CGG ACC ATT GAG ACA TTT GCC AAG GAG GAA CCT AAG GAA GAT ATT 2064 
His Arg Thr He Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp He 
675 680 685 

GAC GTC TCT ATT CTG CCC CAG CTG GAG CAT TGC AGC TCC AGG AAG ATG 2112 
35 Asp Val Ser He Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met 
690 695 700 

AAT ACA TGG CTT GGC ATT TTC TAT GGT TAC AAG GGG CTG CTG CTG CTG 2160 
Asn Thr Trp Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu 
40 705 710 715 720 

CTG GGA ATC TTC CTT GCT TAT GAG ACC AAG AGT GTG TCC ACT *GAG AAG 2208 
Leu Gly He Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys 
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725 730 735 ~~ 



ATC AAT GAT CAC CGG GCT GTG GGC ATG GCT ATC TAC AAT GTG GCA GTC 2256 
lie Asn Asp His Arg Ala Val Gly Met Ala lie Tyr Asn Val Ala Val 
5 740 745 750 



CTG TGC CTC ATC ACT GCT CCT GTC ACC 

Leu Cys Leu lie Thr Ala Pro Val Thr 

755 760 

10 

GAT GCA GCC TTT GCC TTT GCC TCT CTT 

Asp Ala Ala Phe Ala Phe Ala Ser Leu 

770 775 

15 ATC ACT CTT GTT GTG CTC TTT GTG CCC 

lie Thr Leu Val Val Leu Phe Val Pro 
785 790 



ATG ATT CTG TCC AGC CAG CAG 23 04 

Met lie Leu Ser Ser Gin Gin 
765 

GCC ATA GTT TTC TCC TCC TAT 23 52 

Ala lie Val Phe Ser Ser Tyr 
780 

AAG ATG CGC AGG CTG ATC ACC 2400 
Lys Met Arg Arg Leu lie Thr 
795 800 



CGA GGG GAA TGG CAG TCG GAG GCG CAG GAC ACC ATG AAG ACA GGG TCA 2448 
20 Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser 

805 810 815 



TCG ACC AAC AAC AAC GAG GAG GAG AAG TCC CGG CTG TTG GAG AAG GAG 2496 
Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu 
25 820 825 830 

AAC CGT GAA CTG GAA AAG ATC ATT GCT GAG AAA GAG GAG CGT GTC TCT 2544 
Asn Arg Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu Arg Val Ser 
835 840 845 

30 

GAA CTG CGC CAT CAA CTC CAG TCT CGG CAG CAG CTC CGC TCC CGG CGC 2592 
Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg 
850 855 860 

35 CAC CCA CCG ACA CCC CCA GAA CCC TCT GGG GGC CTG CCC AGG GGA CCC 2640 
His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro 
865 870 875 880 

CCT GAG CCC CCC GAC CGG CTT AGC TGT GAT GGG AGT CGA GTG CAT TTG 2688 
40 Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu 

885 890 895 

CTT TAT AAG TGA 2700 
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Leu Tyr Lys * 



5 (2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 899 amino acids 

(B) TYPE: amino acid 
10 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

15 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 20 25 30 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

25 Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

30 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

Val Asn Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala 
35 100 105 110 

Leu Phe Pro Met Ser Gly Gly Trp Pro Gly Gly Gin Ala Cys Gin Pro 
115 120 125 

40 Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg Asp lie Leu 
130 135 140 

Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys Cys Asp Pro 
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145 150 155 160 

Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie 
165 170 175 

5 

Lys lie . He Leu Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala 
180 185 190 

Glu Ala Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser Ser 
10 195 200 205 

Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe Phe Arg Thr 
210 215 220 

15 His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu 
225 230 235 240 

Lys Trp Gly Trp Lys Lys He Ala Thr lie Gin Gin Thr Thr Glu Val 
245 250 255 

20 

Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly 
260 265 270 

He Glu He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro 
25 275 280 285 

Val Lys Asn Leu Lys Arg Gin Asp Ala Arg He He Val Gly Leu Phe 
290 295 300 

30 Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg 
305 310 315 320 

Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp Tyr Ala Asp 
325 330 335 

35 

Asn Trp Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu 
340 345 350 

Met Thr Glu Ala Val Glu Gly His He Thr Thr Glu He Val Met Leu 
40 355 360 365 

Asn Pro Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe 
370 375 380 
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Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly 
385 390 395 400 

5 Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp Ala Leu Ala 

405 410 415 

Leu Ala Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg 
420 425 430 

10 

Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr lie Thr Asp Gin lie Tyr 
435 440 445 

Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val 
15 450 455 460 

Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu 
465 470 475 480 

20 Gin Gly Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp 

485 490 495 

Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ser Pro Pro 
500 505 510 

25 

Ala Asp Gin Thr Leu Val lie Lys Thr Phe Arg Phe Leu Ser Gin Lys 
515 520 525 

Leu Phe lie Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala 
30 530 535 540 

Val Val Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr lie 
545 550 555 560 

35 Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser 

565 570 575 

Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie 
580 585 590 

40 

Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu 
595 600 605 
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Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie Trp 
610 615 620 

Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg 
5 625 630 635 640 

Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val 
645 650 655 

10 Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin lie Val Asp Pro Leu 
660 665 670 

His Arg Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp lie 
675 680 685 

15 

Asp Val Ser lie Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met 
690 695 700 

Asn Thr Trp Leu Gly lie Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu 
20 705 710 715 720 

Leu Gly lie Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys 
725 730 735 

25 lie Asn Asp His Arg Ala Val Gly Met Ala lie Tyr Asn Val Ala Val 
740 745 750 

Leu Cys Leu lie Thr Ala Pro Val Thr Met lie Leu Ser Ser Gin Gin 
755 760 765 

30 

Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala lie Val Phe Ser Ser Tyr 
770 775 780 

lie Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu lie Thr 
35 785 790 795 800 

Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser 
805 810 815 

40 Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu 
820 825 830 

Asn Arg Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu Arg Val Ser 
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835 



840 



845 



Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arcj 
850 855 860 

His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly 
865 870 875 

Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val Hi;; 

885 890 89!: 

Leu Tyr Lys 



(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 

( i i i ) HYPOTHETICAL : NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION : 1 . .294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 72: 

ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CC. 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pr 
15 10 1 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CI<. 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gl 
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20 25 30 ~ 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
5 35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

10 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

15 TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 
Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

GTG TGA TCCAGGCCAA GCCACCAAGT AC CT ATATG A GCTGCTCTAC AACGACCCTA 344 
20 Val 



TCAAGATCAT CCTTATGCCT GGCTGCAGCT CTGTCTCCAC GCTGGTGGCT GAGGCTGCTA 404 

25 GGATGTGGAA CCTCATTGTG CTTTCCTATG GCTCCAGCTC ACCAGCCCTG TCAAACCGGC 464 

AGCGTTTCCC CACTTTCTTC CGAACGCACC CATCAGCCAC ACTCCACAAC CCTACCCGCG 52 4 

TGAAACTCTT TGAAAAGTGG GGCTGGAAGA AGATTGCTAC CATCCAGCAG ACCACTGAGG 5 84 

30 

TCTTCACTTC GACTCTGGAC GACCTGGAGG AACGAGTGAA GGAGGCTGGA ATTGAGATTA 644 

CTTTCCGCCA GAGTTTCTTC TCAGATCCAG CTGTGCCCGT CAAAAACCTG AAGCGCCAGG 704 

35 ATGCCCGAAT CATCGTGGGA CTTTTCTATG AGACTGAAGC CCGGAAAGTT TTTTGTGAGG 7 64 

TGTACAAGGA GCGTCTCTTT GGGAAGAAGT ACGTCTGGTT CCTCATTGGG TGGTATGCTG 824 

ACAATTGGTT CAAGATCTAC GACCCTTCTA TCAACTGCAC AGTGGATGAG ATGACTGAGG 884 

CGGTGGAGGG CCACATCACA ACTGAGATTG TCATGCTGAA TCCTGCCAAT ACCCGCAGCA 944 

TTTCCAACAT GACATCCCAG GAATTTGTGG AGAAACTAAC CAAGCGACTG AAAAGACACC 1004 



40 
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CTGAGGAGAC AGGAGGCTTC CAGGAGGCAC CGCTGGCCTA TGATGCCATC TGGGCCTTGG 1064 

CACTGGCCCT GAACAAGACA TCTGGAGGAG GCGGCCGTTC TGGTGTGCGC CTGGAGGACT 1124 

TCAACTACAA CAACCAGACC ATTACCGACC AAATCTACCG GGCAATGAAC TCTTCGTCCT 1184 

TTGAGGGTGT CTCTGGCCAT GTGGTGTTTG ATGCCAGCGG CTCTCGGATG GCATGGACGC 1244 

10 TTATCGAGCA GCTTCAGGGT GGCAGCTACA AGAAGATTGG CTACTATGAC AGCACCAAGG 13 04 

ATGATCTTTC CTGGTCCAAA ACAGATAAAT GGATTGGAGG GTCCCCCCCA GCTGACCAGA 13 64 

CCCTGGTCAT CAAGACATTC CGCTTCCTGT CACAGAAACT CTTTATCTCC GTCTCAGTTC 1424 

TCTCCAGCCT GGGCATTGTC CTAGCTGTTG TCTGTCTGTC CTTTAACATC TACAACTCAC 1484 

ATGTCCGTTA TATCCAGAAC TCACAGCCCA AC CTGAAC AA CCTGACTGCT GTGGGCTGCT 1544 

CACTGGCTTT AGCTGCTGTC TTCCCCCTGG GGCTCGATGG TTACCACATT GGGAGGAACC 1604 

AGTTTCCTTT CGTCTGCCAG GCCCGCCTCT GGCTCCTGGG CCTGGGCTTT AGTCTGGGCT 1664 

ACGGTTCCAT GTTCACCAAG ATTTGGTGGG TCCACACGGT CTTCACAAAG AAGGAAGAAA 1724 

AGAAGGAGTG GAGGAAGACT CTGGAACCCT GGAAGCTGTA TGCCACAGTG GGCCTGCTGG 1784 

TGGGCATGGA TGTCCTCACT CTCGCCATCT GGCAGATCGT GGACCCTCTG CACCGGACCA 1844 

30 TTGAGACATT TGCCAAGGAG GAACCTAAGG AAGATATTGA CGTCTCTATT CTGCCCCAGC 1904 

TGGAGCATTG CAGCTCCAGG AAGATGAATA CATGGCTTGG CATTTTCTAT GGTTACAAGG 1964 

GGCTGCTGCT GCTGCTGGGA ATCTTCCTTG CTTATGAGAC CAAGAGTGTG TCCACTGAGA 2024 

35 

AGATCAATGA TCACCGGGCT GTGGGCATGG CTATCTACAA TGTGGCAGTC CTGTGCCTCA 2084 

TCACTGCTCC TGTCACCATG ATTCTGTCCA GCCAGCAGGA TGCAGCCTTT GCCTTTGCCT 2144 

40 CTCTTGCCAT AGTTTTCTCC TCCTATATCA CTCTTGTTGT GCTCTTTGTG CCCAAGATGC 2204., 

GCAGGCTGAT CACCCGAGGG GAATGGCAGT CGGAGGCGCA GGACACCATG AAGACAGGGT 2264 



25 



HI 865-1 WO SEQ 



134 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 

CATCGACCAA CAACAACGAG GAGGAGAAGT CCCGGCTGTT GGAGAAGGAG AACCGTGAAC 23~Z4 

TGGAAAAGAT CATTGCTGAG AAAGAGGAGC GTGTCTCTGA ACTGCGCCAT CAACTCCAGT 2384 

5 CTCGGCAGCA GCTCCGCTCC CGGCGCCACC CACCGACACC CCCAGAACCC TCTGGGGGCC 2444 

TGCCCAGGGG ACCCCCTGAG CCCCCCGACC GGCTTAGCTG TGATGGGAGT CGAGTGCATT 2504 

TGCTTTATAA GTGA 2518 

10 

(2) INFORMATION FOR SEQ ID NO : 73: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



20 



25 



(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
1 5 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
30 3 5 4 0 4 5 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

35 Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 



40 



Val 
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(2) INFORMATION FOR SEQ ID NO : 74: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2679 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

10 <ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

15 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 
20 (A) NAME / KEY : CDS 

(B) LOCATION : 1 . .2 679 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 

25 ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
30 Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 



40 



TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val *Arg Lys 
65 70 75 80 
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TGC CTG GCC AAC GGC TCC TGG ACA GAT 
Cys Leu Ala Asn Gly Ser Trp Thr Asp 
85 

5 

GAA CGG CGC GCA GTG TAC ATC GGG GCA 
Glu Arg Arg Ala Val Tyr lie Gly Ala 
100 105 



PCT/SE98/01947 

ATG GAC ACA CCC AGC CGC TGT 2 88 

Met Asp Thr Pro Ser Arg Cys 
90 95 

CTG TTT CCC ATG AGC GGG GGC 336 
Leu Phe Pro Met Ser Gly Gly 
110 



10 TGG CCA GGG GGC CAG GCC TGC CAG CCC GCG GTG GAG ATG GCG CTG GAG 3 84 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
115 120 125 

GAC GTG AAT AGC CGC AGG GAC ATC CTG CCG GAC TAT GAG CTC AAG CTC 432 
15 Asp Val Asn Ser Arg Arg Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu 
130 135 140 

ATC CAC CAC GAC AGC AAG TGT GAT CCA GGC CAA GCC ACC AAG TAC CTA 480 
lie His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
20 145 150 155 160 

TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC ATC CTT ATG CCT GGC 528 
Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu Met Pro Gly 
165 170 175 

25 

TGC AGC TCT GTC TCC ACG CTG GTG GCT GAG GCT GCT AGG ATG TGG AAC 576 
Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
180 185 190 

30 CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA GCC CTG TCA AAC CGG 624 
Leu lie Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
195 200 205 

CAG CGT TTC CCC ACT TTC TTC CGA ACG CAC CCA TCA GCC ACA CTC CAC 672 
35 Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr Leu His 
210 215 220 

AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG GGC TGG AAG AAG ATT 720 
Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
40 225 230 235 240 

GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT TCG ACT CTG* GAC GAC 768 
Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 
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245 250 255 " 

CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG ATT ACT TTC CGC CAG 816 
Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
5 260 265 270 

AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA AAC CTG AAG CGC CAG 864 

Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
275 280 285 

10 

GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG ACT GAA GCC CGG AAA 912 

Asp Ala Arg lie He Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
290 295 300 

15 GTT TTT TGT GAG GTG TAC AAG GAG CGT CTC TTT GGG AAG AAG TAC GTC 960 
Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
305 310 315 320 

TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG TTC AAG ATC TAC GAC 1008 
20 Trp Phe Leu He Gly Trp Tyr Ala Asp Asn Trp Phe Lys He Tyr Asp 

325 330 335 

CCT TCT ATC AAC TGC ACA GTG GAT GAG ATG ACT GAG GCG GTG GAG GGC 1056 
Pro Ser He Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
25 340 345 350 

CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT GCC AAT ACC CGC AGC 1104 
His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
355 360 365 



30 



35 



40 



ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG AAA CTA ACC AAG CGA 1152 
He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
370 375 380 

CTG AAA AG A CAC CCT GAG GAG ACA GGA GGC TTC CAG GAG GCA CCG CTG 1200 
Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
385 390 395 400 

GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC CTG AAC AAG ACA TCT 1248 
Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser u 
405 410 415 

GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG GAC TTC AAC TAC AAC 1296 
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Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
420 425 430 

AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA ATG AAC TCT TCG TCC 1344 
5 Asn Gin Thr lie Thr Asp Gin lie Tyr Arg Ala Met Asn Ser Ser Ser 
435 440 445 

TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT GCC AGC GGC TCT CGG 1392 
Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
10 450 455 460 

ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT GGC AGC TAC AAG AAG 1440 

Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 

465 470 475 480 

15 

ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT TCC TGG TCC AAA ACA 1488 

lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
485 490 495 

20 GAT AAA TGG ATT GGA GGG TCC CCC CCA GCT GAC CAG ACC CTG GTC ATC 153 6 

Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val lie 
500 505 510 

AAG ACA TTC CGC TTC CTG TCA CAG AAA CTC TTT ATC TCC GTC TCA GTT 1584 
25 Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie Ser Val Ser Val 
515 520 525 

CTC TCC AGC CTG GGC ATT GTC CTA GCT GTT GTC TGT CTG TCC TTT AAC 1632 
Leu Ser Ser Leu Gly lie Val Leu Ala Val Val Cys Leu Ser Phe Asn 
30 5 3 0 5 3 5 5 4 0 

ATC TAC AAC TCA CAT GTC CGT TAT ATC CAG AAC TCA CAG CCC AAC CTG 1680 
lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn Ser Gin Pro Asn Leu 
545 550 555 560 

35 

AAC AAC CTG ACT GCT GTG GGC TGC TCA CTG GCT TTA GCT GCT GTC TTC 1728 
Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
565 570 575 

40 CCC CTG GGG CTC GAT GGT TAC CAC ATT GGG AGG AAC CAG TTT CCT TTC 1776 
Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Asn Gin Phe Pro Phe 
580 585 590 
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GTC TGC CAG GCC CGC CTC 

Val Cys Gin Ala Arg Leu 
595 

5 TAC GGT TCC ATG TTC ACC 

Tyr Gly Ser Met Phe Thr 
610 

AAG AAG GAA GAA AAG AAG 

10 Lys Lys Glu Glu Lys Lys 

625 630 



TGG CTC CTG GGC CTG GGC 
Trp Leu Leu Gly Leu Gly 
600 

AAG ATT TGG TGG GTC CAC 
Lys lie Trp Trp Val His 
615 620 

GAG TGG AGG AAG ACT CTG 
Glu Trp Arg Lys Thr Leu 
635 



TTT AGT CTG GGC 18T4 

Phe Ser Leu Gly 

605 

ACG GTC TTC ACA 1872 
Thr Val Phe Thr 

GAA CCC TGG AAG 1920 
Glu Pro Trp Lys 
640 



CTG TAT GCC ACA GTG GGC CTG CTG GTG GGC ATG GAT GTC CTC ACT CTC 1968 
Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
15 645 650 655 



GCC ATC TGG CAG ATC GTG 
Ala He Trp Gin He Val 
660 

20 

GCC AAG GAG GAA CCT AAG 
Ala Lys Glu Glu Pro Lys 
675 

25 CTG GAG CAT TGC AGC TCC 
Leu Glu His Cys Ser Ser 
690 



GAC CCT CTG CAC CGG ACC 
Asp Pro Leu His Arg Thr 
665 

GAA GAT ATT GAC GTC TCT 
Glu Asp He Asp Val Ser 
680 

AGG AAG ATG AAT ACA TGG 
Arg Lys Met Asn Thr Trp 
695 700 



ATT GAG ACA TTT 2016 

He Glu Thr Phe 
670 

ATT CTG CCC CAG 2064 

He Leu Pro Gin 

685 

CTT GGC ATT TTC 2112 

Leu Gly He Phe 



TAT GGT TAC AAG GGG CTG CTG CTG 

30 Tyr Gly Tyr Lys Gly Leu Leu Leu 
705 710 

GAG ACC AAG AGT GTG TCC ACT GAG 

Glu Thr Lys Ser Val Ser Thr Glu 
35 725 



CTG CTG GGA ATC TTC CTT GCT TAT 2160 
Leu Leu Gly He Phe Leu Ala Tyr 
715 720 

AAG ATC AAT GAT CAC CGG GCT GTG 2208 
Lys He Asn Asp His Arg Ala Val 
730 735 



GGC ATG GCT ATC TAC AAT 
Gly Met Ala He Tyr Asn 
740 

40 

GTC ACC ATG ATT CTG TCC 
Val Thr Met He Leu Ser 
755 



GTG GCA GTC CTG TGC CTC 
Val Ala Val Leu Cys Leu 
745 

AGC CAG CAG GAT GCA GCC 
Ser Gin Gin Asp Ala Ala 
760 



ATC ACT GCT CCT 2256 
He Thr Ala Pro 
750 

TTT GCC TTT GCC 2304 

Phe Ala* Phe Ala 

765 
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TCT CTT GCC ATA GTT TTC TCC TCC TAT ATC ACT CTT GTT GTG CTC TTT 2352 
Ser Leu Ala lie Val Phe Ser Ser Tyr lie Thr Leu Val Val Leu Phe 
770 775 780 

5 

GTG CCC AAG ATG CGC AGG CTG ATC ACC CGA GGG GAA TGG CAG TCG GAG 2400 
Val Pro Lys Met Arg Arg Leu lie Thr Arg Gly Glu Trp Gin Ser Glu 
785 790 795 800 

10 GCG CAG GAC ACC ATG AAG AC A GGG TCA TCG ACC AAC AAC AAC GAG GAG 2448 
Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu 
805 810 815 

GAG AAG TCC CGG CTG TTG GAG AAG GAG AAC CGT GAA CTG GAA AAG ATC 2496 
15 Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys lie 
820 825 830 

ATT GCT GAG AAA GAG GAG CGT GTC TCT GAA CTG CGC CAT CAA CTC CAG 2544 
lie Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg His Gin Leu Gin 
20 835 840 845 

TCT CGG CAG CAG CTC CGC TCC CGG CGC CAC CCA CCG ACA CCC CCA GAA 2592 

Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro Thr Pro Pro Glu 
850 855 860 

25 

CCC TCT GGG GGC CTG CCC AGG GGA CCC CCT GAG CCC CCC GAC CGG CTT 2640 

Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu 
865 870 875 880 

30 AGC TGT GAT GGG AGT CGA GTG CAT TTG CTT TAT AAG TGA 2679 
Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys * 
885 890 



35 (2) INFORMATION FOR SEQ ID NO : 75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 92 amino acids 

(B) TYPE: amino acid 
40 (D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 



HI 865-1 WO SEQ 



141 

SUBSTITUTE SHEET (RULE 26) 



WO 99/21890 PCT/SE98/01947 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 75: ~ 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

5 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
10 35 40 45 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

15 Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

20 

Glu Arg Arg Ala Val Tyr lie Gly Ala Leu Phe Pro Met Ser Gly Gly 
100 105 110 

Trp Pro Gly Gly Gin Ala Cys Gin Pro Ala Val Glu Met Ala Leu Glu 
25 115 120 125 

Asp Val Asn Ser Arg Arg Asp He Leu Pro Asp Tyr Glu Leu Lys Leu 
130 135 140 

30 He His His Asp Ser Lys Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu 
145 150 155 160 

Tyr Glu Leu Leu Tyr Asn Asp Pro He Lys He He Leu Met Pro Gly 
165 170 175 

35 

Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg Met Trp Asn 
180 185 190 

Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg 
40 195 200 205 

Gin Arg Phe Pro Thr Phe Phe Arg Thr His Pro Ser Ala Thr* Leu His 
210 215 220 
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Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp Lys Lys lie 
225 230 235 240 

5 Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr Leu Asp Asp 

245 250 255 

Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr Phe Arg Gin 
260 265 270 

10 

Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu Lys Arg Gin 
275 280 285 

Asp Ala Arg lie lie Val Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys 
15 290 295 300 

Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val 
305 310 315 320 

20 Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys lie Tyr Asp 

325 330 335 

Pro Ser lie Asn Cys Thr Val Asp Glu Met Thr Glu Ala Val Glu Gly 
340 345 350 

25 

His lie Thr Thr Glu lie Val Met Leu Asn Pro Ala Asn Thr Arg Ser 
355 360 365 

lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu Thr Lys Arg 
30 370 375 380 

Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu 
385 390 395 400 

35 Ala Tyr Asp Ala lie Trp Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser 

405 410 415 

Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn 
420 425 430 

40 

Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn Ser Ser Ser 
435 440 445 
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Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser Gly Ser Arg 
450 455 460 

Met Ala Trp Thr Leu lie Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys 
5 465 470 475 480 

lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr 
485 490 495 

10 Asp Lys Trp lie Gly Gly Ser Pro Pro Ala Asp Gin Thr Leu Val lie 
500 505 510 

Lys Thr Phe Arg Phe Leu Ser Gin Lys Leu Phe lie Ser Val Ser Val 
515 520 525 

15 

Leu Ser Ser Leu Gly lie Val Leu Ala Val Val Cys Leu Ser Phe Asn 
530 535 540 

lie Tyr Asn Ser His Val Arg Tyr lie Gin Asn Ser Gin Pro Asn Leu 
20 545 550 555 560 

Asn Asn Leu Thr Ala Val Gly Cys Ser Leu Ala Leu Ala Ala Val Phe 
565 570 575 

25 Pro Leu Gly Leu Asp Gly Tyr His lie Gly Arg Asn Gin Phe Pro Phe 
580 585 590 

Val Cys Gin Ala Arg Leu Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly 
595 600 605 

30 

Tyr Gly Ser Met Phe Thr Lys lie Trp Trp Val His Thr Val Phe Thr 
610 615 620 

Lys Lys Glu Glu Lys Lys Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys 
35 625 630 635 640 

Leu Tyr Ala Thr Val Gly Leu Leu Val Gly Met Asp Val Leu Thr Leu 
645 650 655 

40 Ala lie Trp Gin lie Val Asp Pro Leu His Arg Thr lie Glu Thr Phe 
660 665 670 

Ala Lys Glu Glu Pro Lys Glu Asp lie Asp Val Ser lie Leu Pro Gin 
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675 680 685 ~ 

Leu Glu His Cys Ser Ser Arg Lys Met Asn Thr Trp Leu Gly lie Phe 
690 695 700 

5 

Tyr Gly Tyr Lys Gly Leu Leu Leu Leu Leu Gly He Phe Leu Ala Tyr 
705 710 715 720 

Glu Thr Lys Ser Val Ser Thr Glu Lys He Asn Asp His Arg Ala Val 
10 725 730 735 

Gly Met Ala He Tyr Asn Val Ala Val Leu Cys Leu He Thr Ala Pro 
740 745 750 

15 Val Thr Met He Leu Ser Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala 
755 760 765 

Ser Leu Ala He Val Phe Ser Ser Tyr He Thr Leu Val Val Leu Phe 
770 775 780 

20 

Val Pro Lys Met Arg Arg Leu He Thr Arg Gly Glu Trp Gin Ser Glu 
785 790 795 800 

Ala Gin Asp Thr Met Lys Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu 
25 805 810 815 

Glu Lys Ser Arg Leu Leu Glu Lys Glu Asn Arg Glu Leu Glu Lys He 
820 825 830 

30 He Ala Glu Lys Glu Glu Arg Val Ser Glu Leu Arg His Gin Leu Gin 
835 840 845 

Ser Arg Gin Gin Leu Arg Ser Arg Arg His Pro Pro Thr Pro Pro Glu 
850 855 860 

35 

Pro Ser Gly Gly Leu Pro Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu 
865 870 875 880 

Ser Cys Asp Gly Ser Arg Val His Leu Leu Tyr Lys 
40 8 8 5 8 9 0 



(2) INFORMATION FOR SEQ ID NO : 76: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2661 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

10 (iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 
15 (A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY: CDS 

(B) LOCATION: 1 . .2 661 
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20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 



ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
25 1 5 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

35 GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
40 Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 
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Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

GTG AAT CGA ACG CCA CAC TCA GAA CGG CGC GCA GTG TAC ATC GGG GCA 33 6 

5 Val Asn Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr lie Gly Ala 
100 105 110 

CTG TTT CCC GCG GTG GAG ATG GCG CTG GAG GAC GTG AAT AGC CGC AGG 3 84 

Leu Phe Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg 
10 115 120 125 

GAC ATC CTG CCG GAC TAT GAG CTC AAG CTC ATC CAC CAC GAC AGC AAG 432 

Asp lie Leu Pro Asp Tyr Glu Leu Lys Leu lie His His Asp Ser Lys 
130 135 140 

15 

TGT GAT CCA GGC CAA GCC ACC AAG TAC CTA TAT GAG CTG CTC TAC AAC 480 

Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn 

145 150 155 160 

20 GAC CCT ATC AAG ATC ATC CTT ATG CCT GGC TGC AGC TCT GTC TCC ACG 528 
Asp Pro lie Lys lie lie Leu Met Pro Gly Cys Ser Ser Val Ser Thr 
165 170 175 

CTG GTG GCT GAG GCT GCT AGG ATG TGG AAC CTC ATT GTG CTT TCC TAT 576 
25 Leu Val Ala Glu Ala Ala Arg Met Trp Asn Leu lie Val Leu Ser Tyr 
180 185 190 

GGC TCC AGC TCA CCA GCC CTG TCA AAC CGG CAG CGT TTC CCC ACT TTC 624 
Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe 
30 195 200 205 

TTC CGA ACG CAC CCA TCA GCC ACA CTC CAC AAC CCT ACC CGC GTG AAA 672 
Phe Arg Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys 
210 215 220 



35 



40 



CTC TTT GAA AAG TGG GGC TGG AAG AAG ATT GCT ACC ATC CAG CAG ACC 720 
Leu Phe Glu Lys Trp Gly Trp Lys Lys lie Ala Thr lie Gin Gin Thr 
225 230 235 240 

ACT GAG GTC TTC ACT TCG ACT CTG GAC GAC CTG GAG GAA CGA GTG AAG 768, 
Thr Glu Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys 
245 250 *255 
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GAG GCT GGA ATT 
Glu Ala Gly lie 
260 

5 GCT GTG CCC GTC 
Ala Val Pro Val 
275 

GGA CTT TTC TAT 
10 Gly Leu Phe Tyr 
290 

AAG GAG CGT CTC 
Lys Glu Arg Leu 
15 305 

TAT GCT GAC AAT 
Tyr Ala Asp Asn 

20 

GTG GAT GAG ATG 
Val Asp Glu Met 
340 

25 GTC ATG CTG AAT 
Val Met Leu Asn 
355 

CAG GAA TTT GTG 
30 Gin Glu Phe Val 
370 

GAG ACA GGA GGC 
Glu Thr Gly Gly 
35 3 85 

GCC TTG GCA CTG 
Ala Leu Ala Leu 

40 

GGT GTG CGC CTG 
Gly Val Arg Leu 
420 



GAG ATT ACT TTC CGC 
Glu lie Thr Phe Arg 
265 

AAA AAC CTG AAG CGC 
Lys Asn Leu Lys Arg 
280 

GAG ACT GAA GCC CGG 
Glu Thr Glu Ala Arg 
295 

TTT GGG AAG AAG TAC 
Phe Gly Lys Lys Tyr 
310 

TGG TTC AAG ATC TAC 
Trp Phe Lys lie Tyr 
325 

ACT GAG GCG GTG GAG 
Thr Glu Ala Val Glu 
345 

CCT GCC AAT ACC CGC 
Pro Ala Asn Thr Arg 
360 

GAG AAA CTA ACC AAG 
Glu Lys Leu Thr Lys 
375 

TTC CAG GAG GCA CCG 
Phe Gin Glu Ala Pro 
390 

GCC CTG AAC AAG ACA 
Ala Leu Asn Lys Thr 
405 

GAG GAC TTC AAC TAC 
Glu Asp Phe Asn Tyr 
425 



CAG AGT TTC TTC TCA 
Gin Ser Phe Phe Ser 

270 

.> 

CAG GAT GCC CGA ATC 
Gin Asp Ala Arg lie 
285 

AAA GTT TTT TGT GAG 
Lys Val Phe Cys Glu 
300 

GTC TGG TTC CTC ATT 
Val Trp Phe Leu lie 
315 

GAC CCT TCT ATC AAC 
Asp Pro Ser lie Asn 
330 

GGC CAC ATC ACA ACT 
Gly His lie Thr Thr 
350 

AGC ATT TCC AAC ATG 
Ser lie Ser Asn Met 
365 

CGA CTG AAA AGA CAC 
Arg Leu Lys Arg His 
380 

CTG GCC TAT GAT GCC 
Leu Ala Tyr Asp Ala 
395 

TCT GGA GGA GGC GGC 
Ser Gly Gly Gly Gly 
410 

AAC AAC CAG ACC ATT 
Asn Asn Gin Thr lie 
430 



PCT/SE98/01947 

GAT CCA 8T6 
Asp Pro 

ATC GTG 864 
lie Val 

GTG TAC 912 
Val Tyr 

GGG TGG 960 
Gly Trp 
320 

TGC ACA 1008 

Cys Thr 

335 

GAG ATT 1056 
Glu lie 

ACA TCC 1104 
Thr Ser 

CCT GAG 1152 
Pro Glu 

ATC TGG 1200 
lie Trp 
400 

CGT TCT 1248 

Arg Ser 

415 

ACC GAC 1296 
Thr Asp 
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CAA ATC TAC CGG GCA ATG AAC TCT TCG TCC TTT GAG GGT GTC TCT GGC 13 44 

Gin He Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly 
435 440 445 

CAT GTG GTG TTT GAT GCC AGC GGC TCT CGG ATG GCA TGG ACG CTT ATC 13 92 

His Val Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He 
450 455 460 

GAG CAG CTT CAG GGT GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC 1440 
Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser 
465 470 475 480 

ACC AAG GAT GAT CTT TCC TGG TCC AAA ACA GAT AAA TGG ATT GGA GGG 1488 
Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly 
485 490 495 

TCC CCC CCA GCT GAC CAG ACC CTG GTC ATC AAG ACA TTC CGC TTC CTG 153 6 

Ser Pro Pro Ala Asp Gin Thr Leu Val He Lys Thr Phe Arg Phe Leu 
500 505 510 

TCA CAG AAA CTC TTT ATC TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT 1584 
Ser Gin Lys Leu Phe He Ser Val Ser Val Leu Ser Ser Leu Gly He 
515 520 525 

GTC CTA GCT GTT GTC TGT CTG TCC TTT AAC ATC TAC AAC TCA CAT GTC 1632 
Val Leu Ala Val Val Cys Leu Ser Phe Asn He Tyr Asn Ser His Val 
530 535 540 

CGT TAT ATC CAG AAC TCA CAG CCC AAC CTG AAC AAC CTG ACT GCT GTG 1680 
Arg Tyr He Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val 
545 550 555 560 

GGC TGC TCA CTG GCT TTA GCT GCT GTC TTC CCC CTG GGG CTC GAT GGT 1728 
Gly Cys Ser Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly 
565 570 575 

TAC CAC ATT GGG AGG AAC CAG TTT CCT TTC GTC TGC CAG GCC CGC CTC 1776 
Tyr His He Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu 
580 585 590 

TGG CTC CTG GGC CTG GGC TTT AGT CTG GGC TAC GGT TCC ATG* TTC ACC 1824 
Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr 
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595 600 605 

AAG ATT TGG TGG GTC CAC ACG GTC TTC ACA AAG AAG GAA GAA AAG AAG 1872 

Lys lie Trp Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys 
5 610 615 620 



GAG TGG AGG AAG ACT CTG GAA CCC TGG AAG CTG TAT GCC ACA GTG GGC 1920 
Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly 
625 630 635 640 

CTG CTG GTG GGC ATG GAT GTC CTC ACT CTC GCC ATC TGG CAG ATC GTG 1968 
Leu Leu Val Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin lie Val 
645 650 655 



15 GAC CCT CTG CAC CGG ACC ATT GAG ACA TTT GCC AAG GAG GAA CCT AAG 2016 

Asp Pro Leu His Arg Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys 
660 665 670 

GAA GAT ATT GAC GTC TCT ATT CTG CCC CAG CTG GAG CAT TGC AGC TCC 2064 

20 Glu Asp lie Asp Val Ser lie Leu Pro Gin Leu Glu His Cys Ser Ser 
675 680 685 

AGG AAG ATG AAT ACA TGG CTT GGC ATT TTC TAT GGT TAG AAG GGG CTG 2112 

Arg Lys Met Asn Thr Trp Leu Gly lie Phe Tyr Gly Tyr Lys Gly Leu 
25 690 695 700 



CTG CTG CTG CTG GGA ATC TTC CTT GCT TAT GAG ACC AAG AGT GTG TCC 2160 

Leu Leu Leu Leu Gly lie Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser 

705 710 715 720 

ACT GAG AAG ATC AAT GAT CAC CGG GCT GTG GGC ATG GCT ATC TAC AAT 2208 

Thr Glu Lys lie Asn Asp His Arg Ala Val Gly Met Ala lie Tyr Asn 
725 730 735 



35 GTG GCA GTC CTG TGC CTC ATC ACT GCT CCT GTC ACC ATG ATT CTG TCC 2256 

Val Ala Val Leu Cys Leu He Thr Ala Pro Val Thr Met He Leu Ser 

740 745 750 

AGC CAG CAG GAT GCA GCC TTT GCC TTT GCC TCT CTT GCC ATA GTT TTC 23 04 

40 Ser Gin Gin Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe 
755 760 765 

TCC TCC TAT ATC ACT CTT GTT GTG CTC TTT GTG CCC AAG ATG CGC AGG 23 52 
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Ser Ser Tyr lie Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg 
770 775 780 

CTG ATC ACC CGA GGG GAA TGG CAG TCG GAG GCG CAG GAC ACC ATG AAG 2400 
Leu He Thr Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys 
785 790 795 800 

ACA GGG TCA TCG ACC AAC AAC AAC GAG GAG GAG AAG TCC CGG CTG TTG 2448 
Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu 
805 810 815 

GAG AAG GAG AAC CGT GAA CTG GAA AAG ATC ATT GCT GAG AAA GAG GAG 2496 
Glu Lys Glu Asn Arg Glu Leu Glu Lys lie He Ala Glu Lys Glu Glu 
820 825 830 

CGT GTC TCT GAA CTG CGC CAT CAA CTC CAG TCT CGG CAG CAG CTC CGC 2544 
Arg Val Ser Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg 
835 840 845 

TCC CGG CGC CAC CCA CCG ACA CCC CCA GAA CCC TCT GGG GGC CTG CCC 2592 
Ser Arg Arg His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro 
850 855 860 

AGG GGA CCC CCT GAG CCC CCC GAC CGG CTT AGC TGT GAT GGG AGT CGA 2640 
Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg 
865 870 875 880 

GTG CAT TTG CTT TAT AAG TGA 2661 
Val His Leu Leu Tyr Lys 
885 



(2) INFORMATION FOR SEQ ID NO : 77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
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Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly — 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 . 

Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

Val Asn Arg Thr Pro His Ser Glu Arg Arg Ala Val Tyr He Gly Ala 
100 105 HO 

Leu Phe Pro Ala Val Glu Met Ala Leu Glu Asp Val Asn Ser Arg Arg 
115 120 125 

Asp He Leu Pro Asp Tyr Glu Leu Lys Leu He His His Asp Ser Lys 
130 135 140 

Cys Asp Pro Gly Gin Ala Thr Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn 
145 150 155 160 

Asp Pro He Lys He He Leu Met Pro Gly Cys Ser Ser Val Ser Thr 
165 170 175 

Leu Val Ala Glu Ala Ala Arg Met Trp Asn Leu He Val Leu Ser Tyr 
180 185 190 

Gly Ser Ser Ser Pro Ala Leu Ser Asn Arg Gin Arg Phe Pro Thr Phe 
195 200 205 

Phe Arg Thr His Pro Ser Ala Thr Leu His Asn Pro Thr Arg Val Lys 
210 215 220 



Leu 



Phe Glu Lys Trp Gly Trp Lys Lys He Ala Thr He Gin Gin Thr 
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230 



235 



PCT/SE98/01947 

240 



Thr Glu Val Phe Thr Ser Thr Leu Asp Asp Leu Glu Glu Arg Val Lys 
245 250 255 

Glu Ala Gly lie Glu He Thr Phe Arg Gin Ser Phe Phe Ser Asp Pro 
260 265 270 

Ala Val Pro Val Lys Asn Leu Lys Arg Gin Asp Ala Arg He He Val 
275 280 285 

Gly Leu Phe Tyr Glu Thr Glu Ala Arg Lys Val Phe Cys Glu Val Tyr 
290 295 300 

Lys Glu Arg Leu Phe Gly Lys Lys Tyr Val Trp Phe Leu He Gly Trp 
305 310 315 320 

Tyr Ala Asp Asn Trp Phe Lys He Tyr Asp Pro Ser He Asn Cys Thr 
325 330 335 

Val Asp Glu Met Thr Glu Ala Val Glu Gly His He Thr Thr Glu He 
340 345 350 

Val Met Leu Asn Pro Ala Asn Thr Arg Ser He Ser Asn Met Thr Ser 
355 360 365 

Gin Glu Phe Val Glu Lys Leu Thr Lys Arg Leu Lys Arg His Pro Glu 
370 375 380 

Glu Thr Gly Gly Phe Gin Glu Ala Pro Leu Ala Tyr Asp Ala lie Trp 
385 390 395 400 

Ala Leu Ala Leu Ala Leu Asn Lys Thr Ser Gly Gly Gly Gly Arg Ser 
405 410 415 

Gly Val Arg Leu Glu Asp Phe Asn Tyr Asn Asn Gin Thr He Thr Asp 
420 425 430 

Gin He Tyr Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly 
435 440 445 

His Val Val Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He 
450 455 460 
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Glu Gin Leu Gin Gly Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser 
465 470 475 480 

5 Thr Lys Asp Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly 

485 490 495 

Ser Pro Pro Ala Asp Gin Thr Leu Val lie Lys Thr Phe Arg Phe Leu 
500 505 510 

10 

Ser Gin Lys Leu Phe lie Ser Val Ser Val Leu Ser Ser Leu Gly lie 
515 520 525 

Val Leu Ala Val Val Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val 
15 530 535 540 

Arg Tyr lie Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val 
545 550 555 560 

20 Gly Cys Ser Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly 

565 570 575 

Tyr His lie Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu 
580 585 590 

25 

Trp Leu Leu Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr 
595 600 605 

Lys lie Trp Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys 
30 610 615 620 

Glu Trp Arg Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly 
625 630 635 640 

35 Leu Leu Val Gly Met Asp Val Leu Thr Leu Ala lie Trp Gin lie Val 

645 650 655 

Asp Pro Leu His Arg Thr lie Glu Thr Phe Ala Lys Glu Glu Pro Lys 
660 665 670 

40 

Glu Asp lie Asp Val Ser lie Leu Pro Gin Leu Glu His Cys^Ser Ser 
675 680 685 
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Arg Lys Met Asn Thr Trp Leu Gly lie Phe Tyr Gly Tyr Lys Gly Leu 
690 695 700 

Leu Leu Leu Leu Gly lie Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser 
705 710 715 720 

Thr Glu Lys He Asn Asp His Arg Ala Val Gly Met Ala He Tyr Asn 
725 730 735 



Val Ala Val Leu Cys Leu He Thr 
740 

Ser Gin Gin Asp Ala Ala Phe Ala 
755 760 



Ala Pro Val Thr Met He Leu Ser 
745 750 

Phe Ala Ser Leu Ala He Val Phe 
765 



Ser Ser Tyr He Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg 
770 775 780 

Leu He Thr Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys 
785 790 795 800 

Thr Gly Ser Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu 
805 810 815 

Glu Lys Glu Asn Arg Glu Leu Glu Lys He He Ala Glu Lys Glu Glu 
820 825 830 

Arg Val Ser Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg 
835 840 845 

Ser Arg Arg His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro 
850 855 860 

Arg Gly Pro Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg 
865 870 875 880 

Val His Leu Leu Tyr Lys 
885 



(2) INFORMATION FOR SEQ ID NO: 78: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 1692 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 



(iii) HYPOTHETICAL: NO 



10 <vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME/ KEY : CDS 
15 (B) LOCATION : 1 . .1692 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 



ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
20 Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
25 20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

30 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

35 TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 2 88 

40 Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 

85 90 95 

GTC CGA ATC TGC TCC AAG TCT TAT TTG ACC ATT ACC GAC CAA ATC TAC 336 
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Val Arg lie Cys Ser Lys Ser Tyr Leu Thr lie Thr Asp Gin lie Tyr 
100 105 110 

CGG GCA ATG AAC TCT TCG TCC TTT GAG GGT GTC TCT GGC CAT GTG GTG 3 84 

5 Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val 
115 120 125 

TTT GAT GCC AGC GGC TCT CGG ATG GCA TGG ACG CTT ATC GAG CAG CTT 432 
Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu lie Glu Gin Leu 
10 130 135 140 

CAG GGT GGC AGC TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT 480 
Gin Gly Gly Ser Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp 
145 150 155 160 

15 

GAT CTT TCC TGG TCC AAA ACA GAT AAA TGG ATT GGA GGG TCC CCC CCA 528 
Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp lie Gly Gly Ser Pro Pro 
165 170 175 

20 GCT GAC CAG ACC CTG GTC ATC AAG ACA TTC CGC TTC CTG TCA CAG AAA 576 
Ala Asp Gin Thr Leu Val lie Lys Thr Phe Arg Phe Leu Ser Gin Lys 
180 185 190 

CTC TTT ATC TCC GTC TCA GTT CTC TCC AGC CTG GGC ATT GTC CTA GCT 624 
25 Leu Phe lie Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala 
195 200 205 

GTT GTC TGT CTG TCC TTT AAC ATC TAC AAC TCA CAT GTC CGT TAT ATC 672 
Val Val Cys Leu Ser Phe Asn lie Tyr Asn Ser His Val Arg Tyr lie 
210 215 220 



30 



35 



40 



CAG AAC TCA CAG CCC AAC CTG AAC AAC CTG ACT GCT GTG GGC TGC TCA 720 
Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser 
225 230 235 240 

CTG GCT TTA GCT GCT GTC TTC CCC CTG GGG CTC GAT GGT TAC CAC ATT 768 
Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie 
245 250 255 

GGG AGG AAC CAG TTT CCT TTC GTC TGC CAG GCC CGC CTC TGG CTC CTG 816 ;j 
Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu 
260 265 270* 
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GGC CTG GGC TTT AGT CTG 
Gly Leu Gly Phe Ser Leu 
275 



GGC TAC GGT TCC ATG TTC 
Gly Tyr Gly Ser Met Phe 
280 



PCT/SE98/01947 

ACC AAG ATT TGG SF4 

Thr Lys lie Trp 

285 



TGG GTC CAC ACG GTC TTC ACA AAG AAG GAA GAA AAG AAG GAG TGG AGG 912 
Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg 
290 295 300 

AAG ACT CTG GAA CCC TGG AAG CTG TAT GCC ACA GTG GGC CTG CTG GTG 960 
Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val 
305 310 315 320 

GGC ATG GAT GTC CTC ACT CTC GCC ATC TGG CAG ATC GTG GAC CCT CTG 1008 
Gly Met Asp Val Leu Thr Leu Ala He Trp Gin He Val Asp Pro Leu 
325 330 335 

CAC CGG ACC ATT GAG ACA TTT GCC AAG GAG GAA CCT AAG GAA GAT ATT 1056 
His Arg Thr He Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp He 
340 345 350 

GAC GTC TCT ATT CTG CCC CAG CTG GAG CAT TGC AGC TCC AGG AAG ATG 1104 
Asp Val Ser He Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met 
355 360 365 

AAT ACA TGG CTT GGC ATT TTC TAT GGT TAC AAG GGG CTG CTG CTG CTG 1152 
Asn Thr Trp Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu 
370 375 380 

CTG GGA ATC TTC CTT GCT TAT GAG ACC AAG AGT GTG TCC ACT GAG AAG 1200 
Leu Gly He Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys 
385 390 395 400 

ATC AAT GAT CAC CGG GCT GTG GGC ATG GCT ATC TAC AAT GTG GCA GTC 1248 
He Asn Asp His Arg Ala Val Gly Met Ala He Tyr Asn Val Ala Val 
405 410 415 

CTG TGC CTC ATC ACT GCT CCT GTC ACC ATG ATT CTG TCC AGC CAG CAG 1296 
Leu Cys Leu He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin 
420 425 430 

•J 

GAT GCA GCC TTT GCC TTT GCC TCT CTT GCC ATA GTT TTC TCC TCC TAT 1344 
Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr 
435 440 445 
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ATC ACT CTT GTT GTG CTC TTT GTG CCC AAG ATG CGC AGG CTG ATC ACC 1392 
lie Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu lie Thr 
450 455 460 

CGA GGG GAA TGG CAG TCG GAG GCG CAG GAC ACC ATG AAG ACA GGG TCA 1440 
Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser 
465 470 475 480 

TCG ACC AAC AAC AAC GAG GAG GAG AAG TCC CGG CTG TTG GAG AAG GAG 1488 
Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu 
485 490 495 

AAC CGT GAA CTG GAA AAG ATC ATT GCT GAG AAA GAG GAG CGT GTC TCT 153 6 

Asn Arg Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu Arg Val Ser 
500 505 510 

GAA CTG CGC CAT CAA CTC CAG TCT CGG CAG CAG CTC CGC TCC CGG CGC 1584 
Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg 
515 520 525 

CAC CCA CCG ACA CCC CCA GAA CCC TCT GGG GGC CTG CCC AGG GGA CCC 1632 
His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro 
530 535 540 

CCT GAG CCC CCC GAC CGG CTT AGC TGT GAT GGG AGT CGA GTG CAT TTG 1680 
Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu 
545 550 555 560 

CTT TAT AAG TGA ±692 
Leu Tyr Lys * 



( 2 ) INFORMATION FOR SEQ ID NO : 7 9: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 563 amino acids 

(B) TYPE : amino acid 
(D) TOPOLOGY : linear 

(ii) MOLECULE TYPE: protein 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

Val Arg He Cys Ser Lys Ser Tyr Leu Thr He Thr Asp Gin He Tyr 
100 105 HO 

Arg Ala Met Asn Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val 
115 120 125 

Phe Asp Ala Ser Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin Leu 
130 135 140 

Gin Gly Gly Ser Tyr Lys Lys He Gly Tyr Tyr Asp Ser Thr Lys Asp 
145 150 155 160 

Asp Leu Ser Trp Ser Lys Thr Asp Lys Trp He Gly Gly Ser Pro Pro 
165 170 175 

Ala Asp Gin Thr Leu Val He Lys Thr Phe Arg Phe Leu Ser Gin Lys 
180 185 190 

Leu Phe He Ser Val Ser Val Leu Ser Ser Leu Gly lie Val Leu Ala 
195 200 205 

Val Val Cys Leu Ser Phe Asn He Tyr Asn Ser His Val Arg Tyr He 
210 215 220 
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Gin Asn Ser Gin Pro Asn Leu Asn Asn Leu Thr Ala Val Gly Cys Ser 
225 230 235 240 

5 Leu Ala Leu Ala Ala Val Phe Pro Leu Gly Leu Asp Gly Tyr His lie 

245 250 255 

Gly Arg Asn Gin Phe Pro Phe Val Cys Gin Ala Arg Leu Trp Leu Leu 
260 265 270 

10 

Gly Leu Gly Phe Ser Leu Gly Tyr Gly Ser Met Phe Thr Lys lie Trp 
275 280 285 

Trp Val His Thr Val Phe Thr Lys Lys Glu Glu Lys Lys Glu Trp Arg 
15 290 295 300 

Lys Thr Leu Glu Pro Trp Lys Leu Tyr Ala Thr Val Gly Leu Leu Val 
305 310 315 320 

20 Gly Met Asp Val Leu Thr Leu Ala He Trp Gin He Val Asp Pro Leu 

325 330 335 

His Arg Thr He Glu Thr Phe Ala Lys Glu Glu Pro Lys Glu Asp He 
340 345 350 

25 

Asp Val Ser He Leu Pro Gin Leu Glu His Cys Ser Ser Arg Lys Met 
355 360 365 

Asn Thr Trp Leu Gly He Phe Tyr Gly Tyr Lys Gly Leu Leu Leu Leu 
30 3 7 0 3 7 5 3 8 0 

Leu Gly He Phe Leu Ala Tyr Glu Thr Lys Ser Val Ser Thr Glu Lys 
385 390 395 400 

35 He Asn Asp His Arg Ala Val Gly Met Ala He Tyr Asn Val Ala Val 

405 410 415 

Leu Cys Leu He Thr Ala Pro Val Thr Met He Leu Ser Ser Gin Gin 
420 425 430 

40 

Asp Ala Ala Phe Ala Phe Ala Ser Leu Ala He Val Phe Ser Ser Tyr 
435 440 445 
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lie Thr Leu Val Val Leu Phe Val Pro Lys Met Arg Arg Leu lie Thr ~ 
450 455 460 

Arg Gly Glu Trp Gin Ser Glu Ala Gin Asp Thr Met Lys Thr Gly Ser 
465 470 475 480 

Ser Thr Asn Asn Asn Glu Glu Glu Lys Ser Arg Leu Leu Glu Lys Glu 
485 490 495 

Asn Arg Glu Leu Glu Lys lie lie Ala Glu Lys Glu Glu Arg Val Ser 

500 505 510 

Glu Leu Arg His Gin Leu Gin Ser Arg Gin Gin Leu Arg Ser Arg Arg 
515 520 525 

His Pro Pro Thr Pro Pro Glu Pro Ser Gly Gly Leu Pro Arg Gly Pro 
530 535 540 

Pro Glu Pro Pro Asp Arg Leu Ser Cys Asp Gly Ser Arg Val His Leu 
545 550 555 560 

Leu Tyr Lys 



(2) INFORMATION FOR SEQ ID NO: 80: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

<ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 1. .318 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 



48 



ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
1 5 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin He 
20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly . Leu Thr Arg 
35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAT 192 
Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Asp 
50 55 60 

GAA TCG AAC GCC ACA CTC AGA ACG GCG CGC AGT GTA CAT CGG GGC ACT 240 
Glu Ser Asn Ala Thr Leu Arg Thr Ala Arg Ser Val His Arg Gly Thr 
65 70 75 80 

GTT TCC CAT GAG CGG GGG CTG GCC AGG GGG CCA GGC CTG CCA GCC CGC 2 88 

Val Ser His Glu Arg Gly Leu Ala Arg Gly Pro Gly Leu Pro Ala Arg 
85 90 95 

GGT GGA GAT GGC GCT GGA GGA CGT GAA TAG CCGCAGGGAC ATCCTGCCGG 33 8 

Gly Gly Asp Gly Ala Gly Gly Arg Glu * 
100 105 

ACTATGAGCT CAAGCTCATC CACCACGACA GCAAGTGTGA TCCAGGCCAA GCCACCAAGT 3 98 

ACCTATATGA GCTGCTCTAC AACGACCCTA TCAAGATCAT CCTTATGCCT GGCTGCAGCT 458 

CTGTCTCCAC GCTGGTGGCT GAGGC TGCTA GGATGTGGAA CCTCATTGTG CTTTCCTATG 518 

GCTCCAGCTC ACCAGCCCTG TCAAACCGGC AGCGTTTCCC CACTTTCTTC CGAACGCACC 578 

CATCAGCCAC ACTCCACAAC CCTACCCGCG TGAAACTCTT TGAAAAGTGG GGCTGGAAGA 63 8.^ 

AGATTGCTAC CATCCAGCAG ACCACTGAGG TCTTCACTTC GACTCTGGAC GACCTGGAGG 698 
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AACGAGTGAA GGAGGCTGGA ATTGAGATTA CTTTCCGCCA GAG TTTCTTC TCAGATCCAG 75"8 
CTGTGCCCGT CAAAAACCTG AAGCGCCAGG ATGCCCGAAT CATCGTGGGA CTTTTCTATG 818 
AGACTGAAGC CCGGAAAGTT TTTTGTGAGG TGTACAAGGA GCGTCTCTTT GGGAAGAAGT 878 
ACGTCTGGTT CCTCATTGGG TGGTATGCTG ACAATTGGTT CAAGATCTAC GACCCTTCTA 938 

TCAACTGCAC AGTGGATGAG ATGACTGAGG CGGTGGAGGG CCACATCACA ACTGAGATTG .998 

TCATGCTGAA TCCTGCCAAT ACCCGCAGCA TTTCCAACAT GACATCCCAG GAATTTGTGG 1058 

AGAAACTAAC CAAGCGACTG AAAAGACACC CTGAGGAGAC AGGAGGCTTC CAGGAGGCAC 1118 

CGCTGGCCTA TGATGCCATC TGGGCCTTGG CACTGGCCCT GAACAAGACA TCTGGAGGAG 1178 

GCGGCCGTTC TGGTGTGCGC CTGGAGGACT TCAACTACAA CAACCAGACC ATTACCGACC 123 8 



AAATCTACCG GGCAATGAAC TCTTCGTCCT TTGAGGGTGT CTCTGGCCAT GTG GTGTTTG 
ATGCCAGCGG CTCTCGGATG GCATGGACGC TTATCGAGCA GCTTCAGGGT GGCAGCTACA 



1298 
1358 



AGAAGATTGG CTACTATGAC AGCACCAAGG ATGATCTTTC CTGGTCCAAA ACAGATAAAT 1418 



GGATTGGAGG GTCCCCCCCA GCTGACCAGA CCCTGGTCAT CAAGACATTC CGCTTCCTGT 
CACAGAAACT CTTTATCTCC GTCTCAGTTC TCTCCAGCCT GGGCATTGTC CTAGCTGTTG 
TCTGTCTGTC CTTTAACATC TACAACTCAC ATGTCCGTTA TATCCAGAAC TCACAGCCCA 
ACCTGAACAA CCTGACTGCT GTGGGCTGCT CACTGGCTTT AGCTGCTGTC TTCCCCCTGG 



1478 
1538 
1598 
1658 



GGCTCGATGG TTACCACATT GGGAGGAACC AGTTTCCTTT CGTCTGCCAG GCCCGCCTCT 1718 



GGCTCCTGGG CCTGGGCTTT AGTCTGGGCT ACGGTTCCAT GTTCACCAAG ATTTGGTGGG 
TCCACACGGT CTTCACAAAG AAGGAAGAAA AGAAGGAGTG GAGGAAGACT CTGGAACCCT 
GGAAGCTGTA TGCCACAGTG GGCCTGCTGG TGGGCATGGA TGTCCTCACT CTCGCCATCT 
GGCAGATCGT GGACCCTCTG CACCGGACCA TTGAGACATT TGCCAAGGAG GAACCTAAGG 
AAGATATTGA CGTCTCTATT CTGCCCCAGC TGGAGCATTG CAGCTCCAGG AAGATGAATA 



1778 
1838 
1898 
1958 
2018 
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CATGGCTTGG CATTTTCTAT GGTTACAAGG GGCTGCTGCT GCTGCTGGGA ATCTTCCTTG 207 8 

CTTATGAGAC CAAGAGTGTG TCCACTGAGA AGATCAATGA TCACCGGGCT GTGGGCATGG 213 8 

CTATCTACAA TGTGGCAGTC CTGTGCCTCA TCACTGCTCC TGTCACCATG ATTCTGTCCA 2198 

GCCAGCAGGA TGCAGCCTTT GCCTTTGCCT CTCTTGCCAT AGTTTTCTCC TCCTATATCA 2258 

CTCTTGTTGT GCTCTTTGTG CCCAAGATGC GCAGGCTGAT CACCCGAGGG GAATGGCAGT 2318 

CGGAGGCGCA GGACACCATG AAGACAGGGT CATCGACCAA CAACAACGAG GAGGAGAAGT 2378 

CCCGGCTGTT GGAGAAGGAG AACCGTGAAC TGGAAAAGAT CATTGCTGAG AAAGAGGAGC 243 8 

GTGTCTCTGA ACTGCGCCAT CAACTCCAGT CTCGGCAGCA GCTCCGCTCC CGGCGCCACC 2498 

CACCGACACC CCCAGAACCC TCTGGGGGCC TGCCCAGGGG ACCCCCTGAG CCCCCCGACC 2558 

20 G GCTTAGCTG TGATGGGAGT CGAGTGCATT TGCTTTATAA GTGA 2602 

( 2 ) INFORMATION FOR SEQ ID NO : 81: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 105 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

30 (ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
35 1 5 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

40 lie His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Asp 
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Glu Ser Asn Ala Thr Leu Arg Thr Ala Arg Ser Val His Arg Gly Thr 

65 70 75 80 

Val Ser His Glu Arg Gly Leu Ala Arg Gly Pro Gly Leu Pro Ala Arg 

85 90 95 

Gly Gly Asp Gly Ala Gly Gly Arg Glu 

100 105 



(2) INFORMATION FOR SEQ ID NO : 82: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 67 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 

(A) NAME /KEY : CDS 

(B) LOCATION: 1. .294 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 

ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 
Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG * ACT CGG 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
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v. 

35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 
Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 
Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 288 
Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

GTG TGA TCCAGGCCAA GCCACCAAGT ACCTATATGA GCTGCTCTAC AACGACCCTA 344 
Val * 



TCAAGATCAT CCTTATGCCT GGCTGCAGCT CTGTCTCCAC GCTGGTGGCT GAGGCTGCTA 404 

GGATGTGGAA CCTCATTGTG CTTTCCTATG GCTCCAGCTC ACCAGCCCTG TCAAACCGGC 464 

AGCGTTTCCC CACTTTCTTC CGAACGCACC CATCAGCCAC ACTCCACAAC CCTACCCGCG 524 

TGAAACTCTT TGAAAAGTGG GGCTGGAAGA AGATTGCTAC CATC C AGC AG ACCACTGAGG 584 

TCTTCACTTC GACTCTGGAC GACCTGGAGG AACGAGTGAA GGAGGCTGGA ATTGAGATTA 644 

CTTTCCGCCA GAGTTTCTTC TCAGATCCAG CTGTGCCCGT CAAAAACCTG AAGCGCCAGG 704 

ATGCCCGAAT CATCGTGGGA CTTTTCTATG AGACTGAAGC CCGGAAAGTT TTTTGTGAGG 764 

TGTACAAGGA GCGTCTCTTT GGGAAGAAGT ACGTCTGGTT CCTCATTGGG TGGTATGCTG 824 

ACAATTGGTT CAAGATCTAC GACCCTTCTA TCAACTGCAC AGTGGATGAG ATGACTGAGG 884 

CGGTGGAGGG CCACATCACA ACTGAGATTG TCATGCTGAA TCCTGCCAAT ACCCGCAGCA 944 

TTTCCAACAT GACATCCCAG GAATTTGTGG AGAAACTAAC CAAGCGACTG AAAAGACACC 1004 

CTGAGGAGAC AGGAGGCTTC CAGGAGGCAC CGCTGGC CTA TGATGCCATC TGGGCCTTGG 1064 

CACTGGCCCT GAACAAGACA TCTGGAGGAG GCGGCCGTTC TGGTGTGCGC CTGGAGGACT 1124 
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TCAACTACAA CAACCAGACC ATTACCGACC AAATCTACCG GGCAATGAAC TCTTCGTCCT 1184 

TTGAGGGTGT CTCTGGCCAT GTGGTGTTTG ATGCCAGCGG CTCTCGGATG GCATGGACGC 1244 

TTATCGAGCA GCTTCAGGGT GGCAGCTACA AGAAGATTGG CTACTATGAC AGCACCAAGG 1304 

ATGATCTTTC CTGGTCCAAA ACAGATAAAT GGATTGTTAT ATCCAGAACT CACAGCCCAA 1364 



10 



CCTGAACAAC CTGACTGCTG TGGGCTGCTC ACTGGCTTTA GCTGCTGTCT TCCCCCTGGG 1424 



15 



GCTCGATGGT TACCACATTG GGAGGAACCA GTTTCCTTTC GTCTGCCAGG CCCGCCTCTG 1484 

GCTCCTGGGC CTGGGCTTTA GTCTGGGCTA CGGTTCCATG TTCACCAAGA TTTGGTGGGT 1544 

CCACACGGTC TTCACAAAGA AGGAAGAAAA GAAGGAGTGG AGGAAGACTC TGGAACCCTG 1604 

GAAGCTGTAT GCCACAGTGG GCCTGCTGGT GGGCATGGAT GTCCTCACTC TCGCCATCTG 1664 



20 



GCAGATCGTG GACCCTCTGC ACCGGACCAT TGAGACATTT GCCAAGGAGG AACCTAAGGA 1724 



25 



AGATATTGAC GTCTCTATTC TGCCCCAGCT GGAGCATTGC AGCTCCAGGA AGATGAATAC 1784 

ATGGCTTGGC ATTTTCTATG GTTACAAGGG GCTGCTGCTG CTGCTGGGAA TCTTCCTTGC 1844 

TTATGAGACC AAGAGTGTGT CCACTGAGAA GATCAATGAT CACCGGGCTG TGGGCATGGC 1904 

TATCTACAAT GTGGCAGTCC TGTGCCTCAT CACTGCTCCT GTCACCATGA TTCTGTCCAG 1964 



30 



CCAGCAGGAT GCAGCCTTTG CCTTTGCCTC TCTTGCCATA GTTTTCTCCT CCTATATCAC 2024 



35 



TCTTGTTGTG CTCTTTGTGC CCAAGATGCG CAGGCTGATC ACCCGAGGGG AATGGCAGTC 2084 

GGAGGCGCAG GACACCATGA AGACAGGGTC ATCGACCAAC AACAACGAGG AGGAGAAGTC 2144 

CCGGCTGTTG GAGAAGGAGA ACCGTGAACT GGAAAAGATC ATTGCTGAGA AAGAGGAGCG 2204 

TGTCTCTGAA CTGCGCCATC AACTCCAGTC TCGGCAGCAG CTCCGCTCCC GGCGCCACCC 2264 



40 ACCGACACCC CCAGAACCCT CTGGGGGCCT GCCCAGGGGA CCCCCTGAGC CCCCCGACCG 2324 r , 



GCTTAGCTGT GATGGGAGTC GAGTGCATTT GCTTTATAAG TGA 
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(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 97 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

He His Pro Pro Trp Glu Gly Gly He Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala He Asn Phe Leu Pro Val Asp Tyr Glu He Glu 
50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 



Val 



(2) INFORMATION FOR SEQ ID NO : 84: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 
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(iii) HYPOTHETICAL; NO 
(iv) ANTI-SENSE: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(ix) FEATURE: 
10 (A) NAME / KEY : CDS 

(B) LOCATION :1. .1491 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84: 

15 ATG TTG CTG CTG CTG CTA CTG GCG CCA CTC TTC CTC CGC CCC CCG GGC 48 
Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

GCG GGC GGG GCG CAG ACC CCC AAC GCC ACC TCA GAA GGT TGC CAG ATC 96 
20 Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

ATA CAC CCG CCC TGG GAA GGG GGC ATC AGG TAC CGG GGC CTG ACT CGG 144 
lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
25 35 40 45 

GAC CAG GTG AAG GCT ATC AAC TTC CTG CCA GTG GAC TAT GAG ATT GAG 192 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
50 55 60 

30 

TAT GTG TGC CGG GGG GAG CGC GAG GTG GTG GGG CCC AAG GTC CGC AAG 240 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

35 TGC CTG GCC AAC GGC TCC TGG ACA GAT ATG GAC ACA CCC AGC CGC TGT 2 88 

Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 
85 90 95 

GTC CGA ATC TGC TCC AAG TCT TAT TTG ACC CTG GAA AAT GGG AAG GTT 336 
40 Val Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 110 

TTC CTG ACG GGT GGG GAC CTC CCA GCT CTG GAC GGA GCC CGG GTG GAT 3 84 
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Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 

TTC CGG TGT GAC CCC GAC TTC CAT CTG TGT GAT CCA GGC CAA GCC ACC 432 
5 Phe Arg Cys Asp Pro Asp Phe His Leu Cys Asp Pro Gly Gin Ala Thr 
130 135 140 

AAG TAC CTA TAT GAG CTG CTC TAC AAC GAC CCT ATC AAG ATC ATC CTT 480 
Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu 
10 145 150 155 160 

ATG CCT GGC TGC AGC TCT GTC TCC ACG CTG GTG GCT GAG GCT GCT AGG 528 
Met Pro Gly Cys Ser Ser Val Ser Thr Leu Val Ala Glu Ala Ala Arg 
165 170 175 

15 

ATG TGG AAC CTC ATT GTG CTT TCC TAT GGC TCC AGC TCA CCA GCC CTG 576 
Met Trp Asn Leu He Val Leu Ser Tyr Gly Ser Ser Ser Pro Ala Leu 
180 185 190 

20 TCA AAC CGG CAG CGT TTC CCC ACT TTC TTC CGA ACG CAC CCA TCA GCC 624 
Ser Asn Arg Gin Arg Phe Pro "Thr Phe Phe Arg Thr His Pro Ser Ala 
195 200 205 

ACA CTC CAC AAC CCT ACC CGC GTG AAA CTC TTT GAA AAG TGG GGC TGG 672 
25 Thr Leu His Asn Pro Thr Arg Val Lys Leu Phe Glu Lys Trp Gly Trp 
210 215 220 

AAG AAG ATT GCT ACC ATC CAG CAG ACC ACT GAG GTC TTC ACT TCG ACT 720 
Lys Lys He Ala Thr He Gin Gin Thr Thr Glu Val Phe Thr Ser Thr 
30 225 230 235 240 

CTG GAC GAC CTG GAG GAA CGA GTG AAG GAG GCT GGA ATT GAG ATT ACT 768 
Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly He Glu He Thr 
245 250 255 



35 



40 



TTC CGC CAG AGT TTC TTC TCA GAT CCA GCT GTG CCC GTC AAA AAC CTG 816 
Phe Arg Gin Ser Phe Phe Ser Asp Pro Ala Val Pro Val Lys Asn Leu 
260 265 270 

AAG CGC CAG GAT GCC CGA ATC ATC GTG GGA CTT TTC TAT GAG ACT GAA 864 o 

Lys Arg Gin Asp Ala Arg He lie Val Gly Leu Phe Tyr Glu Thr Glu 
275 280 285 
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GCC CGG AAA GTT TTT TGT GAG GTG TAG AAG GAG CGT CTC TTT GGG AAG 912 
Ala Arg Lys Val Phe Cys Glu Val Tyr Lys Glu Arg Leu Phe Gly Lys 
290 295 300 

AAG TAC GTC TGG TTC CTC ATT GGG TGG TAT GCT GAC AAT TGG TTC AAG 960 
Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys 
305 310 315 320 

ATC TAC GAC CCT TCT ATC AAC TGC ACA GTG GAT GAG ATG ACT GAG GCG 1008 
He Tyr Asp Pro Ser He Asn Cys Thr Val Asp Glu Met Thr Glu Ala 
325 330 335 

GTG GAG GGC CAC ATC ACA ACT GAG ATT GTC ATG CTG AAT CCT GCC AAT 1056 
Val Glu Gly His He Thr Thr Glu He Val Met Leu Asn Pro Ala Asn 
340 345 350 

ACC CGC AGC ATT TCC AAC ATG ACA TCC CAG GAA TTT GTG GAG AAA CTA 1104 
Thr Arg Ser He Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu 
355 360 365 

ACC AAG CGA CTG AAA AGA CAC CCT GAG GAG ACA GGA GGC TTC CAG GAG 1152 
Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu 
370 375 380 

GCA CCG CTG GCC TAT GAT GCC ATC TGG GCC TTG GCA CTG GCC CTG AAC 1200 
Ala Pro Leu Ala Tyr Asp Ala He Trp Ala Leu Ala Leu Ala Leu Asn 
385 390 395 400 

AAG ACA TCT GGA GGA GGC GGC CGT TCT GGT GTG CGC CTG GAG GAC TTC 1248 
Lys Thr Ser Gly Gly Gly Gly Arg Ser Gly Val Arg Leu Glu Asp Phe 
405 410 415 

AAC TAC AAC AAC CAG ACC ATT ACC GAC CAA ATC TAC CGG GCA ATG AAC 1296 
Asn Tyr Asn Asn Gin Thr He Thr Asp Gin He Tyr Arg Ala Met Asn 
420 425 430 

TCT TCG TCC TTT GAG GGT GTC TCT GGC CAT GTG GTG TTT GAT GCC AGC 1344 
Ser Ser Ser Phe Glu Gly Val Ser Gly His Val Val Phe Asp Ala Ser 
435 440 445 

i> 

GGC TCT CGG ATG GCA TGG ACG CTT ATC GAG CAG CTT CAG GGT GGC AGC 13 92 

Gly Ser Arg Met Ala Trp Thr Leu He Glu Gin Leu Gin Gly Gly Ser 
450 455 460 
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TAC AAG AAG ATT GGC TAC TAT GAC AGC ACC AAG GAT GAT CTT TCC TGG 
Tyr Lys Lys lie Gly Tyr Tyr Asp Ser Thr Lys Asp Asp Leu Ser Trp 
465 470 475 480 



1440 



TCC AAA ACA GAT AAA TGG ATT GTT ATA TCC AGA ACT CAC AGC CCA ACC 
Ser Lys Thr Asp Lys Trp lie Val lie Ser Arg Thr His Ser Pro Thr 
485 490 495 



1488 



TGA ACAACCTGAC TGCTGTGGGC TGCTCACTGG CTTTAGCTGC TGTCTTCCCC 



1541 



CTGGGGCTCG ATGGTTACCA CATTGGGAGG AACCAGTTTC CTTTCGTCTG CCAGGCCCGC 1601 

CTCTGGCTCC TGGGCCTGGG CTTTAGTCTG GGCTACGGTT C CATGTTC AC CAAGATTTGG 1661 

TGGGTCCACA CGGTCTTCAC AAAGAAGGAA GAAAAGAAGG AGTGGAGGAA GACTCTGGAA 1721 

CCCTGGAAGC TGTATGCCAC AGTGGGCCTG CTGGTGGGCA TGGATGTCCT CACTCTCGCC 1781 

ATCTGGCAGA TCGTGGACCC TCTGCACCGG ACCATTGAGA CATTTGCCAA GGAGGAACCT 1841 

AAGGAAGATA TTGACGTCTC TATTCTGCCC CAGCTGGAGC ATTGCAGCTC CAGGAAGATG 1901 

AATACATGGC TTGGCATTTT CTATGGTTAC AAGGGGC TGC TGCTGCTGCT GGGAATCTTC 1961 

CTTGCTTATG AGACCAAGAG TGTGTCCACT GAGAAGATCA ATGATCACCG GGCTGTGGGC 2021 

ATGGCTATCT ACAATGTGGC AGTCCTGTGC CTCATCACTG CTCCTGTCAC CATGATTCTG 2081 

TCCAGCCAGC AGGATGCAGC CTTTGCCTTT GCCTCTCTTG CCATAGTTTT CTCCTCCTAT 2141 

ATCACTCTTG TTGTGCTCTT TGTGCCCAAG ATGCGCAGGC TGATCACCCG AGGGGAATGG 2201 

CAGTCGGAGG CGCAGGACAC CATGAAGACA GGGTCATCGA CCAACAACAA CGAGGAGGAG 2261 

AAGTCCCGGC TGTTGGAGAA GGAGAACCGT GAACTGGAAA AGATCATTGC TGAGAAAGAG 2321 

GAGCGTGTCT CTGAACTGCG CCATCAACTC CAGTCTCGGC AGCAGCTCCG CTCCCGGCGC 23 81 



CACCCACCGA CACCCCCAGA ACCCTCTGGG GGCCTGCCCA GGGGACCCCC TGAGCCCCCC 2441 
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GACCGGCTTA GCTGTGATGG GAGTCGAGTG CATTTGCTTT ATAAGTGA 2*B9 



(2) INFORMATION FOR SEQ ID NO: 85: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 496 amino acids 

(B) TYPE: amino acid 
( D ) TOPOLOGY : 1 inear 

10 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 

!5 Met Leu Leu Leu Leu Leu Leu Ala Pro Leu Phe Leu Arg Pro Pro Gly 
15 10 15 

Ala Gly Gly Ala Gin Thr Pro Asn Ala Thr Ser Glu Gly Cys Gin lie 
20 25 30 

20 

lie His Pro Pro Trp Glu Gly Gly lie Arg Tyr Arg Gly Leu Thr Arg 
35 40 45 

Asp Gin Val Lys Ala lie Asn Phe Leu Pro Val Asp Tyr Glu lie Glu 
25 50 55 60 

Tyr Val Cys Arg Gly Glu Arg Glu Val Val Gly Pro Lys Val Arg Lys 
65 70 75 80 

30 Cys Leu Ala Asn Gly Ser Trp Thr Asp Met Asp Thr Pro Ser Arg Cys 

85 90 95 

Val Arg lie Cys Ser Lys Ser Tyr Leu Thr Leu Glu Asn Gly Lys Val 
100 105 110 

35 

Phe Leu Thr Gly Gly Asp Leu Pro Ala Leu Asp Gly Ala Arg Val Asp 
115 120 125 

Phe Arg Cys Asp Pro Asp Phe His Leu Cys Asp Pro Gly Gin Ala Thr 
40 130 135 140 

Lys Tyr Leu Tyr Glu Leu Leu Tyr Asn Asp Pro lie Lys lie lie Leu 
145 150 155 160 
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Met Pro Gly Cys Ser Ser Val Ser 
165 

5 Met Trp Asn Leu lie Val Leu Ser 
180 

Ser Asn Arg Gin Arg Phe Pro Thr 
195 200 

10 

Thr Leu His Asn Pro Thr Arg Val 
210 215 



Thr Leu Val Ala Glu Ala Ala Arg 
170 175 

Tyr Gly Ser Ser Ser Pro Ala Leu 
185 190 

Phe Phe Arg Thr His Pro Ser Ala 
205 

Lys Leu Phe Glu Lys Trp Gly Trp 
220 



Lys Lys lie Ala Thr lie Gin Gin Thr Thr Glu Val Phe Thr Ser Thr 
15 225 230 235 240 

Leu Asp Asp Leu Glu Glu Arg Val Lys Glu Ala Gly lie Glu lie Thr 

245 250 255 



20 Phe Arg Gin Ser Phe Phe Ser Asp 
260 

Lys Arg Gin Asp Ala Arg lie lie 
275 280 

25 

Ala Arg Lys Val Phe Cys Glu Val 
290 295 



Pro Ala Val Pro Val Lys Asn Leu 
265 270 

Val Gly Leu Phe Tyr Glu Thr Glu 
285 

Tyr Lys Glu Arg Leu Phe Gly Lys 
300 



Lys Tyr Val Trp Phe Leu lie Gly Trp Tyr Ala Asp Asn Trp Phe Lys 
30 305 310 315 320 

lie Tyr Asp Pro Ser lie Asn Cys Thr Val Asp Glu Met Thr Glu Ala 
325 330 335 

35 Val Glu Gly His lie Thr Thr Glu He Val Met Leu Asn Pro Ala Asn 
340 345 350 



Thr Arg Ser lie Ser Asn Met Thr Ser Gin Glu Phe Val Glu Lys Leu 
355 360 365 

40 

Thr Lys Arg Leu Lys Arg His Pro Glu Glu Thr Gly Gly Phe Gin Glu 
370 375 380 
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Ala Pro Leu Ala 
385 

Lys Thr Ser Gly 



Asn Tyr Asn Asn 
420 

Ser Ser Ser Phe 
435 

Gly Ser Arg Met 
450 

Tyr Lys Lys lie 
465 

Ser Lys Thr Asp 



Tyr Asp Ala lie 
390 

Gly Gly Gly Arg 
405 

Gin Thr lie Thr 



Glu Gly Val Ser 
440 

Ala Trp Thr Leu 
455 

Gly Tyr Tyr Asp 
470 

Lys Trp lie Val 
485 



Trp Ala Leu Ala 
395 

Ser Gly Val Arg 
410 

Asp Gin lie Tyr 
425 

Gly His Val Val 



lie Glu Gin Leu 
460 

Ser Thr Lys Asp 
475 

lie Ser Arg Thr 
490 



Leu Ala Leu Asn 
400 

Leu Glu Asp Phe 
415 

Arg Ala Met Asn 
430 

Phe Asp Ala Ser 
445 

Gin Gly Gly Ser 



Asp Leu Ser Trp 
480 

His Ser Pro Thr 
495 
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